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PREFACE 



This book contains the contributions of 1 8 experts at the 1 3th International 
Workshop on Advances in Analog Circuit Design (AACD 2004) in Montreux, 
Switzerland, April 6-8, 2004. 

The 18 tutorials are devided in 3 parts according to the 3 topics: 

- Sensor and Actuator Interface Electronics 

- Integrated High-Voltage Electronics and Power Management 

- Low-Power and High-Resolution ADC’s. 

The program com m ittee consists of: 

- Johan H. Huijsing, Delft University of Technology, The Netherlands 

- Michiel S.J. Steyaert, Katholieke Universiteit Leuven, Belgium 

- Arthur H.M. van Roermund, Eindhoven University of Technology, The 
Netherlands. 

The AACD 2004 organisation was in the hands of: 

- Vlado Valence, Mead Education S.A., 

- Christian Enz, CSEM S.A., and 

- Michiel Declerq, EPFL. 

This book is number 13 in the successful series on Analog Circuit Design. The 
topics covered in previous issues are: 

AACD 2003 Graz (Austria) 

• Fractional-N Synthesis 

• Design for Robustness 

• Line and bus Drivers 

AACD 2002 Spa (Belgium) 

• Structured Mixed-Mode Design 

• Multi-Bit Sigma-Delta Converters 

• Short-Range RF Circuits 

AACD 2001 Noordwijk (The Netherlands) 

• Scalable Analog Circuits 

• High-Speed D/A Converters 

• RF Power Amplifiers 

AACD 2000 Munich (Germany) 

• High-Speed A/D Converters 
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viii 



• Mixed-Signal Design 

• PLLs and Synthesizers 

AACD 1999 Nice (France) 

• XDSL and Other Communication Systems 

• RF-MOST Models and Behavioural Modelling 

• Integrated Filters and Oscillators 

AACD 1998 Copenhagen (Denmark) 

• 1 - Volt Electronics 

• Mixed-Mode Systems 

• LNAS and RF Power Amps for Telecom 

AACD 1997 Como (Italy) 

• RF A/D Converters 

• Sensor and Actuator Interfaces 

• Low-Noise Oscillators, PLLs and Synthesizers 

AACD 1996 Lausanne (Swiss) 

• RF CMOS Circuit Design 

• Bandpass SD and Other Converters 

• Translinear Circuits 

AACD 1995 Villach (Austria) 

• Low Noise/Power/V oltage 

• Mixed-Mode with CAD Tools 

• Voltage, Current and Time References 

AACD 1994 Eindhoven (The Netherlands) 

• Low Power, Low Voltage 

• Integrated Filters 

• Smart Power 

AACD 1993 Leuven (Belgium) 

• Mixed-Mode A/D Design 

• Sensor Interfaces 

• Communication Circuits 

AACD 1 992 Scheveningen (The Netherlands) 

• Opamps 

• ADC 




Analog CAD 






We hope that also the current book in this series will provide a valuable 
contribution to our Analog Circuit Design community. 



Johan Huijsing 




Part I: Sensor and Actuator Interface Electronics 



Sensors and actuators connect the macroscopic analog world with the computer 
through Interface Electronics. These interface circuits have the task to amplify, 
filter, multiplex, convert fi'om analog to digital and vice versa. 

The topic of interface electronics is covered by 6 tutorial papers: 

The first introductory paper firom Anton Bakker, Analog Devices Inc., San Jose, 
California, USA, presents solutions to general interface issues, such as offset and 
accuracy. 

The second paper by Andrew T.K. Tang, Analog Devices, San Jose, California, 
USA, is about low-offset OpAmps. 

The third contribution by Christoph Hagleitner, IBM Zurich, Switzerland, 
presents a wide platform for multi-sensor interfacing. 

The fourth paper by Roland Thewes, Infineon Technologies A.G., Munich, 
Germany, discusses the design of an electro-chemical interface for DNA 
detection. 

The fifth and sixth papers by Christian Enz, CSEM, Neuchatel, Switzerland, and 
Bernard E. Boser, UC Berkeley, California, USA, respectively, offer insight in 
the design and organisation of wireless sensors and their networks. 



Johan Huijsing 
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SMART SENSOR INTERFACES: AN INTRODUCTION 



Anton Bakker 
Analog Devices Inc. 
3550 North First Street 
San Jose, CA 95134 



Abstract 

This paper is an introduction to smart sensor interfaces. 
Definitions are given and some market numbers are shown. 
Most smart sensors require low-offset, low-noise amplifiers. 
Dynamic offset-cancellation techniques are the solution for that 
and principles of autozeroing and chopping are shown. 
Examples of a smart temperature sensor, flow sensor and 
magnetic sensor are shown. 



1. Introduction 

The purpose of this paper is to give an introduction to smart sensor interfaces. 
As the title is already announcing, it will not be possible to do in-depth analysis, 
but an overview will be given of the options and implementations that are 
currently possible and available. Readers are encouraged to use references to 
probe further. 



1.1 What is a smart sensor? 



Although the definition of a smart sensor may look trivial, it has not been that 
trivial in the past. Some people have called their silicon micro-machined 
acceleration sensor with integrated piezo-resistors a smart sensor, while others 
have made a general-purpose smart sensor interface. In both cases the word 
“smart” has been misused. 

In this paper we will use the following generally accepted definition of a smart 
or intelligent sensor: 



J. H. Huijsing et al. (eds.J, Analog Circuit Design, 3 - 22 , 
© 2004 Kluwer Academic Publishers. 
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A device that incorporates both a sensing element and dedicated interface 

electronics. 




Fig. 1. Smart sensor 

A graphical representation is shown in figure 1 . This does not necessarily mean 
that the sensor has to be made with IC technology or that the interface 
electronics need to be integrated on the same substrate, although that will be the 
case in most implementations. The key factor is that both the sensor and the 
electronic readout circuitry are matched and sold together. 




Fig. 2. Comparison of Smart Sensors and MEMS 
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1.2 Are MEMS and smart sensors the same? 

A term that is popping up often with smart sensors is MEMS. MEMS stands for 
Micro Electro-Mechanical System. MEMS and smart sensors have a big 
overlap, but are not the same. Please have a look at figure 2. 

Most MEMS are smart sensors like airbag sensors, gyroscopes and integrated 
microphones. However, a smart temperature sensor is not a MEMS, because it 
is not micro-machined. Micro-actuators like an optical switch (see figure 3, [1]) 
or a micro-injector are MEMS, but are no sensors. And finally pressure and 
acceleration sensors without electronics are genuine MEMS, but not smart 
sensors. 




Fig. 3. Smart optical switch (courtesy of Analog Devices) 



1.3 Why make sensors smart? 



Key words in smart sensor commercials are size, price, performance and 
reliability. Of these four reasons to make sensors smart, at least one is always 
true: Smart sensors require less space than conventional sensors. However, 
price and performance are usually exchangeable. For example, a thermistor is 
cheaper than an integrated temperature sensor, but the latter one has better 
performance, perhaps not in accuracy or sensitivity, but certainly in linearity 
and output interface. Improved reliability is certainly true for some micro- 
mechanical sensors or sensors that have gone to a different signal domain, like 
thermal wind sensors. However, one can not say in general that smart sensors 
are more reliable. Still, smart sensors can have several advantages over normal 
sensors, but it varies per application. 
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1.4 Markets for smart sensors 



Some may ask: Are there any “killer” applications for smart sensors, or is it still 
a niche market, propelled by academics? This question can be easily answered 
by a firm: Yes, there are killer applications and you are using them everyday. 
Probably the most appealing smart sensor is the airbag sensor as used in over 
90% of all new cars sold worldwide, see figure 4 [2]. 




Fig. 4. Smart capacitive acceleration sensor (Courtesy Analog Devices) 



The typical airbag sensor is a micro-machined capacitive acceleration sensor 
with readout amplifier on the same substrate. A good other example is the smart 
temperature sensor with integrated Sigma-Delta ADC and bus interface as used 
in all desktop and notebook PCs for thermal management. Both have 
accumulative sales over 200 million devices in 2002. And we won’t talk here 
about magnetic sensors, CMOS imagers, smart microphones, fingerprint 
sensors and gyroscopes which represent significant markets as well. 

However, the interesting fact is that all above mentioned smart sensors have 
developed in the last decade! They were not there ten years ago or were hardly 
sold! It can therefore easily be stated that the penetration speed of smart sensors 
is greater than micro electronics was 30 years ago. 
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2. Properties of smart sensors 

What is so special about smart sensors that we can’t use our standard 
electronics for it? To answer this question we will need to understand more 
about the special properties of smart sensors and what the consequences are for 
the interface electronics. 



2.1 Signal domains 

A commonly used definition of a sensor is: 

A device that transfers information from one signal domain into another. 

That other domain is usually the electrical domain, but that is not necessary (for 
example a liquid thermometer). However, a smart sensor always transfers 
information into the electrical domain. 

Some readers would remark that actuators (for example a loud speaker) also fit 
the above-mentioned definition of a sensor. That is definitely true and the 
collection of sensors and actuators is called transducers. However, one should 
keep in mind that a sensor is a transducer, but a transducer is not necessarily a 
sensor. 

Another question that might pop up is whether a current sensor is a sensor, 
because it does not transfer information from a signal domain into a different 
domain. The answer to this question is dependent on the implementation. If the 
current sensor is implemented by a Hall sensor sensing the magnetic field 
around a close-by conductor, then this sensor is definitely a sensor according to 
the definition mentioned above. However, if the current sensor is implemented 
by a resistor, then it does not fit the official definition. 

Until now, scientists have defined six signal domains: Electrical, Mechanical, 
Chemical, Magnetic, Thermal and Optical. The last domain is also referred to as 
Electro-Magnetic or Radial. It seems that these are all the domains, but it may 
be that we will find more domains in the future. 

Figure 2 is a graphical representation of some existing smart sensors and how 
they fit in the above described domain model. 
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Fig, 5, Smart sensor overview in signal domain model 

One of the difficulties with this simple model is the position of “hybrid” (or 
two-domain) sensors like the magnetic position sensor and the thermal flow 
sensor. For example, the position sensor could also be well placed in the 
mechanical domain, because position is primarily mechanical information. 
However, since we are focusing on the smart part of the sensor, the magnetic 
position sensor is placed in the magnetic domain, because the input domain of 
the smart sensor is magnetic. 

2.2 Compatibility with IC technology 

Zooming in on figure 5, it can be remarked that there are no killer applications 
for smart chemical sensors. This does not necessarily mean that there are no 
smart chemical sensors at all, but they are more difficult to make. This has to do 
with the fact that chemical sensors are hardly compatible with IC technology. 
Most smart sensors utilize specific properties of silicon to measure their input 
signal. For example, temperature sensors use the base-emitter voltage 
dependence of bipolar transistors, magnetic sensors use the Hall-effect, 
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gyroscopes (see figure 6 [3]) use the mechanical properties of poly-silicon and 
CMOS image sensors use the optical properties of silicon. However, some 
smart sensors add additional materials to the existing silicon IC-technology, 
usually in post-processing steps. An example of this is a smart magnetic sensor 
based on the Magneto-Resistance effect. 




Fig. 6. Smart gyroscope (Courtesy of Analog Devices) 

2.3 Electrical properties of smart sensors 

The previous paragraph showed that most smart sensors are based on specific 
properties of silicon. However, the output domain is always electrical, which 
implies that the output signal is a voltage, a current or a “modulator”. A 
modulator can be an impedance, a capacitance or an inductance and 
theoretically any other combination of these elements. For example, 
thermocouples have a voltage output, optical sensors usually have a current 
output, while mechanical sensors tend to have capacitive or resistive outputs. 
Hall sensors generate a voltage output based on a current input and are therefore 
modulators. 
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2.4 Sensor signal references 



Although smart sensors can have different kinds of output signals, not all output 
signals are equally useful for processing. Because nowadays processing is 
mainly done by digital processors, the preferred output of a smart sensor is a 
digital format.^ Because a digital format has no unit, a reference is always 
needed. 

If a sensor has a voltage output, this is usually a bandgap voltage reference. One 
of the most important features of a voltage reference is that it is stable under all 
circumstances. However, for “modulating” sensors, one tries usually to make a 
reference that is insensitive to the sensing signal, but tracks other undesired 
signals. A good example of this is the reference capacitor seen in airbag 
acceleration sensors or the reference resistor seen in piezo-resistive pressure 
sensors. 

For voltage output sensors, the reference is usually introduced just before the 
Analog-to-Digital Converter (ADC), see figure 7. 




Fig. 7. Voltage output with bandgap voltage reference 

For modulating sensors, the reference is usually introduced much earlier. A 
good example of this is the “Wheatstone” bridge as shown in figure 8 or the 
charge amplifier as shown in figure 9. Be aware that modulating sensors don’t 
need an additional voltage reference! 



’ Although “digital” is officially not part of a signal domain because it has no unit, it is usually placed in the 
electrical domain, because most digital representations are in the electrical domain. However, the signal from a 
CD or hard disc is definitely digital, and definitely not electrical. 
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Fig. 8. Smart resistive interface with reference resistor (Wheatstone 

bridge) 




Cse nse 

Cref 



Fig. 9. Smart capacitive interface with reference capacitor 

Smart sensors with a current output are a slightly different story. Ideally, one 
should make a current reference on chip, but it is much harder to make a good 
reference current than a good voltage reference, which is mainly due to the poor 
absolute accuracy and temperature stability of on-chip resistors. The solution to 
this is usually an external resistor. 

3. Smart sensor interfaces 

After this extensive introduction, we finally come to the point where this day is 
all about: Smart sensor interfaces. Why is the sensor interface such an important 
element? Because it is the component that, next to the sensor, will determine the 
overall performance of your smart sensor. And since it is usually impossible or 
expensive to improve the performance of your sensor, the smart sensor interface 
is the place where you should put your effort in. 
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3.1 Amplifiers 

As could already be seen from the previous figures, the sensor signal is usually 
too small to be directly fed into an ADC. It first needs to be amplified to have 
enough amplitude to be compared with a reference. 

What are the specifications of the amplifiers we are looking for? 

Most sensors have pretty tiny output signals. Most common temperature sensors 
have output signals in the order of 200|LiVrC[4], Hall sensors can have output 
signals less than 1|liV. Bandwidths of the sensors are usually quite limited. 
Again temperature sensors have a bandwidth that usually not exceed lOOHz, 
pressure sensors are limited to approx. IkHz, however, microphones can have a 
bandwidth up to 20kHz. Other specifications to look at are noise and the twin 
Common-Mode Rejection/Power Supply Rejection (CMRR/PSRR). Power 
consumption may be a problem in battery-operated smart sensors. 

If we compare our wish list to “general purpose” amplifiers, we can conclude 
that most significant “mismatches” are found in offset and \!f noise in CMOS 
amplifiers. The offset of a CMOS amplifier is usually in the order of typically 
ImV. This offset is mainly determined by the area of your input transistors. To 
reduce the offset, one could decide to increase the size of the input transistors, 
however, the offset only reduces by the square root of the area, so you are 
running out of juice pretty fast. Almost the same is true for the \/f noise. It can 
be reduced by spending silicon, but it only improves by the square root of the 
area. Actually, we can treat offset and \!f noise as a single phenomena, although 
their physical background is definitely different, see figure 10. 

{log[WV{Hz)I) 




Fig. 10. Noise power spectrum of a CMOS amplifier 



Why don’t we trim the offset? Of course this can be done, but that doesn’t 
reduce the \/f noise and you still have the temperature drift, which can be 1-10 

pwrc. 
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Why not use bipolar instead of CMOS? Bipolar transistors have several 
advantages over CMOS like a 3-5 times lower offset and very little Mf noise. 
However, there are a few reasons why bipolars are not commonly used for 
smart sensor interfaces. The most important one is the base current. Especially 
if you are trying to improve the noise performance by increasing the collector 
current, you are pretty fast hitting a point where the current noise and/or the 
current induced offset will become dominant. A good second reason is that the 
rest of your smart sensor, namely the ADC and probably some digital 
interfacing is made cheaper with CMOS. 



3.2 Dynamic offset-cancellation techniques 



A third good reason that I haven’t mentioned yet is the great solution that exists 
for the CMOS amplifier offset and Hf noise problem. They are generally called 
dynamic offset-cancellation techniques. They can be distinguished from “static” 
offset cancellation techniques like trimming, because they reduce offset “on the 
fly”. Dynamic offset-cancellation techniques can reduce the offset to the micro- 
volt level, while also removing the \/f noise. 

3.3 Auto-zeroing versus chopping 



There are basically two families of dynamic offset-cancellation techniques: 
Auto-zeroing and chopping[5]. Auto-zeroing is shown in figure 11 and can be 
explained as follows. 




Fig. 11. Auto-zero technique 

The offset-cancellation is done in two phases, a sampling phase cpi, and an 
amplification phase (p 2 - During (pi, the input signal is disconnected and the input 
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of amplifier is connected to ground. The offset voltage is now sampled on Caz- 
During the amplification phase, this offset is subtracted, resulting in an output 
voltage that is free from offset. A sample-and-hold circuit is added to make the 
output signal continuously available. 

The principle of the chopper technique is shown in figure 12. 




T 2T T 2T 



Fig. 12. Chopper amplifier principle 
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The input signal Fin is multiplied by a square wave signal with a frequency of 
^chop- The modulated input signal is amplified and demodulated by the same 
signal. The offset is modulated only once and appears as modulated offset 
around the odd harmonics of ^chop- The offset and signal can then be separated 
by a filter. Figure 12 also shows a time-domain explanation of the same 
principle. 

What are the pro’s and con’s of the above-mentioned techniques? The auto-zero 
technique has the advantage that it does not need a filter to remove the offset, 
which makes it better suitable to be used in opamps. The residual offset of both 
techniques is approximately the same. However, the biggest advantage of the 
chopper technique is the residual noise, which is shown in figure 13. 




Fig, 13. Noise spectrum of autozero amplifier (left) and chopper 

amplifier (right) 

The background for this difference is the fact that an auto-zeroed amplifier is a 
sampled system, while the chopper amplifier is a continuous-time system. The 
sampled auto-zero amplifier is folding back high-frequency noise to the base- 
band. In practical implementations, the noise of an auto-zeroed amplifier is two 
to three times higher than of a chopper amplifier. 

3.4 Nested-chopper technique 

The residual offset of both an auto-zero and chopper amplifier is in the order of 
tens of pV for a switching frequency in the order of lOkHz. This residual offset 
finds its origin in the charge injection of the switches. The residual offset is 
approximately linear dependent on the switching frequency for frequencies 
lower than the bandwidth of the amplifier. In most situations an offset of lOpV 
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is sufficient and already lower than the noise. However, in very low bandwidth 
applications, it can be worthwhile to further reduce the offset. This can be done 
by making a combination of dynamic-offset cancellation techniques at different 
frequencies. A good example is the nested-chopper technique as shown in 
figure 14, which has a reported offset of less than lOOnV in a bandwidth of 8 
Hz. [6] 




CLi CH^ CH2 Cl^ 



Fig. 14. Nested chopper amplifier 

3.5 ADCs 

The choice for an ADC topology that fits best for smart sensors, is not that 
difficult. Because the bandwidth is usually limited, a Sigma-Delta ADC is 
almost always suitable. Sigma-Delta ADCs have the advantage that they are 
low-cost and provide ample resolution, easily up to 16 bits. The Sigma-Delta 
sync decimation filter also fits well with the above-mentioned chopper 
techniques, because you can exactly place a notch of your sync filter on top of 
your modulated offset as shown in figure 15. 

H(f) 



frequency 




Fig. 15. Frequency response of sine filter 
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The choice between a first-order or second-order Sigma-Delta ADC is mainly 
dependent on the required resolution and the speed. For integrated temperature 
sensors one can see a shift from first-order to second-order modulators as a 
response to the need for higher resolutions. So far, I am not aware of smart 
sensor with a third-order sigma-delta modulator. 

4. Examples 

4.1 Smart temperature sensor 

One of the successful implementations of a smart sensor is a smart temperature 
sensor. The state of the art is recently published at ISSCC’03 by Pertijs et al.[7]. 
His circuit consists of a second-order Sigma-Delta modulator and various 
dynamic offset-cancellation techniques. The PTAT circuit is shown in figure 
16. 



Vdd 




Fig. 16. PTAT circuit of high-accuracy smart temperature sensor 



It won’t fit in the scope of this presentation to explain all the details of this 
circuit, but this circuit is able to do PTAT measurements with an offset less than 
IpV, which corresponds to a temperature inaccuracy of less than 0.0 l^C. A chip 
microphotograph is shown in figure 17. 








18 




Fig. 17. Chip microphotograph of high-accuracy smart temperature 
sensor (Courtesy of TU Delft) 

The chip measures 2.5mm^ and is fabricated in O.Spm CMOS. The power 
consumption is 430pW @10 samples/s. The accuracy is the best reported so far: 
0.5°C for a temperature range from -50 to 120°C with a single calibration at 
25^C. Noise is 0.03^Cm,s. 



4.2 Smart wind sensor 



Another interesting example of a smart sensor is the smart wind sensor as 
proposed by Makinwa et al. [8,9]. Although this sensor has no huge market, its 
interface is magnificent. The principle of the sensor is shown in figure 18. 




Fig. 18. Principle of smart wind sensor 
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The sensor consists of heaters and thermopiles that can measure temperature 
differences on chip. These temperature differences are a measure of the input 
flow. The challenge of the interface is the measurement of these tiny thermopile 
signals. The more accurate we can measure the sensor signals, the less power 
we have to dissipate to heat up the chip. 




Fig. 19. Principle of smart wind sensor 

The interface is shown in figure 19 and consists of three sigma-delta loops, 
which integrators both in the electrical and thermal domain. The comparators 
are autozeroed for noise and offset improvement. A chip microphotograph is 
shown in figure 20. 




Fig. 20. Chip microphotograph of smart wind sensor (Courtesy ofTU 

Delft) 
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The sensor is made in an inexpensive 1.6|Lim CMOS process and the interface is 
fiilly compatible with this process. Due to the small size of the interface, it 
perfectly fits in the center of the sensor and comes basically at zero cost. Some 
specifications: Temperature above ambient: lO^C, speed accuracy 5%, angle 
accuracy 3^, power dissipation 5mW. 



4.3 Smart magnetic sensor 



Magnetic sensors are widely used in industry, mainly for position and speed 
measurements like Anti-lock Brake Systems (ABS) and motor controls. The 
commonly used magnetic sensor for this is the Hall plate. Hall plates are very 
linear and cheap, but they are limited by huge offsets and, even worse, huge 
offset drift due to temperature and stress. A method to dramatically reduce the 
offset is the spinning current Hall plate is proposed by [10]. This method 
reduces the offset level of a Hall plate to the lOOnV level. To be able to readout 
these signals with this accuracy, a nested-chopper instrumentation is developed. 
The principle of this amplifier is shown in figure 21. 




Fig. 21. Smart spinning-current Hall sensor with nested-chopper 
instrumentation amplifier 
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The instrumentation amplifier has two chopper amplifiers and is chopped at a 
low fi’equency at the outer sides to remove charge injection by the inner 
choppers. A photomicrograph of the smart magnetic sensor is shown in figure 
22 . 




Fig. 22. Smart spinning-current Hall plate 

The instrumentation amplifier has an offset of lOOnV, which corresponds to a 
magnetic field offset of 3p,T, which is approximately 10 times smaller than the 
earth magnetic field! 



5. Conclusions 

The market for smart sensors is growing approximately four times faster than 
the overall semiconductor market and is now approximately $1.5 Billion. Smart 
sensors are usually smaller and cheaper than conventional sensors and have a 
better reliability. In many cases, also the performance can be improved. 
Dynamic offset cancellation techniques are very helpful to improve the 
performance of the readout electronics. Of the existing dynamic offset- 
cancellation techniques, the chopper technique has the lowest noise. Sigma- 
delta ADCs are in general the best choice for digitization of smart sensor 
signals. 
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Abstract 

This chapter will discuss circuit architectures for low-offset 
(<10|LiV) op-amps. The techniques used fall into two categories: 
autozeroing and chopping [1]. Although these two classes of 
techniques could be used for low-offset amplifiers in general, the 
architectures presented here have been designed to process 
continuous-time signals. The architectures have also been 
designed for use as op-amps. This means they need to be stable 
when used with different feedback networks, i.e. they need to be 
compensated. Since the feedback network is application specific 
and left to the customer, most op-amps are designed to be stable 
with the greatest amount of feedback: unity closed-loop gain. 



1. Autozeroing concept 

Autozeroing is a technique to automatically reduce the offset voltage of an 
amplifier. First, the inverting and non-inverting inputs of an amplifier are 
connected together to give a zero differential input. A correction signal is used to 
adjust the output to zero and this correction signal is stored. The input signals 
are then applied to the amplifier, which now exhibits a much lower offset. There 
are several ways to autozero an amplifier. Some techniques are simpler and less 
accurate, whereas others are more complex and more accurate. 

Figures 1 and 2 show the autozeroing concept applied to a transconductance 
amplifier, gm is the transconductance of the amplifier, ro is the output resistance 
of the amplifier, and Vos is the offset of the amplifier. Vi+ and Vj. are the input 
signals, and Vo is the output signal. To reduce the amplifier offset, a trim current 
(Itrim) is added. 
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VqS ^OS 




Figure 1: Phase 1 of Figure 2: Phase 2 of 

manual autozeroing manual autozeroing 



In phase 1 (Figure 1), the inputs of the amplifier are connected together. The 
output is given by: 

~ (^OsSm “ ^TRIM ) 1 ) 

Itrim is adjusted to be equal to Vosgm, setting Vq to 0 thereby canceling out the 
offset voltage of the amplifier. 

In phase 2 (Figure 2), the inputs of the amplifier are connected to the input 
signal, allowing the input signal to be amplified. Itrim (having the same value in 
phase 2 as in phase 1) continues to cancel out the amplifier offset. 



2. Autozeroing implementation 

Figure 3 shows the implementation of the trim current using a second 
transconductance amplifier g ^2 together with two capacitors Ci and 
C 2 . Although a single-ended implementation could be used, a fully differential 
circuit is shown here; fully differential circuits are less sensitive to errors caused 
by charge injection from MOS switches, charge leaking from the capacitors to 
the p-n junctions of the MOS switches, and ground-bounce caused by switching 
logic circuits, gmi is the transconductance amplifier from Figures 1 and 2, and ro 
is the output resistance of gmi and gm2 in parallel. Vqsi is the input offset voltage 
of gmi, and Vos 2 is the input offset voltage of gm 2 - 
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During phase 1, the switches labeled (\>l are closed and the switch labeled (j)2 is 
opened. Both inputs of gmi are connected to Vi+, and the inputs of gm 2 are 
connected to its outputs in a negative feedback configuration. The differential 
output is given by: 

(^0+ “ ^O- ) “ [^OSlSml “ (^0+ “ ^O- ^0S2 ) §m2 1 



Rearranging gives 



(^0+ ^0-)(^“*"§m2^0 )“ ^ot^OSlSml ^OS2§m2] 



(3) 



(Vo,-VoJ = 



(^OSl§ml ^OS2§m2 ) 

l + gm2ro 



(4) 



Since »1, 



Sm2 



(5) 



VcH and Vq- are also the voltages used to generate the trim current; they are 
stored on memory capacitors Ci and C 2 respectively. 
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gm2 is usually made smaller than g^i for several reasons: 

1 . The voltage stored on Ci and C2 is less sensitive to Vos2- 

2 . The corrected offset will be less sensitive to errors introduced when the 
switches connected to Ci and C2 are opened. 

3 . The noise introduced by gm2 will be insignificant compared with the noise 
from gn,i. 

Since the input-referred offset of the autozeroed amplifier can be calculated 
using: 



^ _ (output offset) 

(amplifier gain ) 

the autozeroed offset during phase 1 becomes: 



V. 



(V0.-V0-) 

gmlfo 



( 6 ) 



( 7 ) 



V V 

V _ ''OSl ^OS2 

’OSAZI 

§m2^0 SmTo 

By making ro sufficiently large (e.g. using a folded cascode structure), V 
can be reduced to a few microvolts. 



( 8 ) 

OSAZl 



In phase 2 , the switch labeled (|)2 is closed and the switches labeled (|)1 are 
opened. Opening the switches between the inputs and outputs of gm2 allows 
capacitors C\ and C2 to continue to set gm2's inputs to null out g^i's offset. 
Meanwhile, the inverting input of gmi is connected to Vi_, allowing g^i to 
amplify the input signal. 

Unfortunately, the differential input of gm2 will change slightly during phase 2 . 
There are two reasons. 

1 . Charge injection from the switches across gm2- 

Charge is injected from the MOS switches onto capacitors Ci and C2 when the 
switches are opened [ 2 , 3 ]. This causes changes in the voltages stored at Ci and 
C2. This error can be minimized by making Ci and C2 large and the MOS 
switches small. Making gm2 differential (as illustrated in Figure 3 ) helps because 
the charge injection is similar at Ci and C2, allowing the most of the charge 
injection error to be cancelled by the common-mode rejection of gm2 (assuming 
the voltage across Ci is similar to the voltage across C2). If gm2 is single-ended or 
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if there is a significant differential voltage at Ci and C 2 then dummy switches [4] 
can be used to reduce the charge injection error. 

2. Leakage currents from the switches across gm 2 - 

MOSFETs have reverse-biased p-n junctions at the drain and source terminals. 
MOS switches will therefore draw current from Ci and C 2 , causing the stored 
voltage to change (especially at high temperatures). Again, making gm 2 
differential helps to reduce this error because the discharge of Ci and C 2 will be 
similar (assuming C\ and C 2 are matched and the MOS switches are matched). 
By comparison, a single-ended implementation of gm 2 will be far more sensitive 
to MOS switch leakage. 



If the total error introduced at the input of gm 2 is Va, the input-referred offset of 
the autozeroed amplifier will change by: 



V = V 

^'OSA ''a 



vgml y 



(9) 



Adding this error term to equation (8) gives the input-referred offset during 
phase 2 as: 



V. 



^OSl ^OS2 I ^Agm2 

gm2^0 gml^O gml 



(10) 



Re-writing equation (10) will give a clearer picture of the trade-off between g^i 
and gm 2 : 






Ay 









^OS2 _|_ ^ABm2 
gml^O gml 



( 11 ) 



Vosi is the dominant contributor to Vqsaz if gmi/gm2 is too large. Va is the 
dominant contributor to Vqsaz if gmi/gm2 is too small. If Vqsi and Va are both 
known than an optimum value of gmi/gm2 can be found to minimize Vqsaz- 

Note that by connecting both inputs of gmi to Vi+ during phase 1, the offset of 
gmi is autozeroed at the input signal voltage. This means that the offset voltage 
after autozeroing is independent of the input signal voltage, giving g^i a DC 
common-mode rejection much higher than normal (>120dB). If g^i were 
connected to a fixed voltage Va during autozeroing then gmi’s limited common- 
mode rejection would cause the offset to change when the inputs of gmi are 
disconnected from Va and connected to Vi+ and Vi_. 
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One weakness of the amplifier in Figure 3 is that it cannot be used to amplify a 
signal continuously because it needs to be periodically autozeroed to refresh the 
voltages at Ci and C 2 . The feedforward autozero architecture overcomes this 
limitation. 



3. Feedforward Autozero Architecture 

The feedfoward autozero architecture was developed to allow an autozeroed op- 
amp to continuously amplify a signal. Early implementations such as [5] and [6] 
needed external memory capacitors (0.1|LlF to 10|LtF) and switched at low 
frequencies (below 400Hz). The long time constants associated with using large 
memory capacitors resulted in a long recovery time after an op-amp was 
overloaded (output driven to a supply rail). Nevertheless, the maximum input- 
offset was guaranteed at 8|LiV. Later versions such as [7] integrated the memory 
capacitors onto the silicon chip, switched at higher frequencies (e.g. 2.5kHz), 
and had shorter overload-recovery times (e.g. Sms). This made them much easier 
to use since they appeared much more like a normal op-amp. One disadvantage, 
though, is that the noise from DC to IkHz was large at 90nV/VHz. 

Figure 4 shows a feedforward autozero topology using internal memory 
capacitors; the output resistance of the amplifiers have been omitted for 
simplicity. The switches labeled (|)1 are closed during phase 1 and open during 
phase 2; the switches labeled (|)2 are closed during phase 2 and open during 
phase 1. 
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Figure 4: Feedforward autozero architecture 

gmi, gm2. Cl and C2 are the same as Figure 3: gmi is autozeroed during phase 1. 
The additional components (gm3, gm4, gms, C3 and C4) allow the input signal to be 
amplified whilst gmi is being autozeroed. 

One way to think of this circuit is to treat gmi, gm 3 and gms as a track-and-hold 
circuit with gain: the input signal is tracked and amplified during phase 2, and is 
held on memory capacitors C3 and C4 during phase 1 . The input offset of this 
signal path is low since gmi (the first gain stage) is autozeroed. If the voltage 
gain of gmi (combined with it's output resistance roi) is sufficiently high, then the 
offset voltages of gm 3 and gm 5 have little influence on the offset voltage of the 
complete amplifier. 

When the amplified signal is being held on C3 and C4 during phase 1, gm4 and 
gm 5 provide an alternative path for the input signal to be amplified. gm 3 maintains 
the low offset during phase 1, as long as the input signal does not move 
significantly. 

Note that external feedback is needed for this architecture to work as a low 
distortion amplifier. Without feedback, switching between three gain stages 
during phase 2 and two gain stages during phase 1 will give a lot of distortion. 
Distortion will be discussed in greater detail later. 
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3.1 Compensation 

When the amplifier is used as an op-amp, it needs to be stable when external 
feedback is applied. Figure 5 shows the compensation of the amplifier using two 
capacitors Cci and Cci- During phase 1, when the signal path has two gain 
stages, one compensation capacitor (Cci) is used. During phase 2, when the 
signal path has three gain stages, two compensation capacitors (Cci and Cc2) are 
used. 




Figure 5: Compensating the feedforward autozero architecture 

The interesting point about Cc 2 is that it is used for compensation only during 
phase 2; Cc 2 is connected between the inverting output of gmi and the output of 
gm 5 during this phase. If Cc 2 remained connected between g^i and gm 5 during 
phase 1, however, it would disturb the autozeroing of gmi by gm 2 whenever Vq 
changed. To prevent this disturbance, Cc 2 is disconnected from g^i and 
connected to a voltage Vi during phase 1; Vi should have the same voltage as 
the inverting output of gmi. That way, Cc 2 continues to be charged by changes in 
Vo, minimizing output glitches when it is re-connected to gmi at the beginning of 
phase 2. 
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3.2 Intermodulation Distortion 

One may suspect that switching between two gain stages during phase 1 and 
three gain stages during phase 2 might lead to severe distortion. Indeed, 
intermodulation distortion (IMD) would result if the amplifier had one gain 
during phase 1 and a different gain during phase 2 because the gain applied to 
the input signal would be modulated by the switching frequency. 

Figures 6 and 7 show two different situations for the op-amp open-loop gain 
during phases 1 and 2. In both cases, fs is the signal frequency (assumed to be 
greater than the pole frequency during phase 1), AoLi(fs) is the open-loop gain at 
the signal frequency during phase 1, and AoL 2 (fs) is the open-loop gain at the 
signal frequency during phase 2. If the gain-bandwidth product (GBW) was 
different between phase 1 and phase 2 (Figure 6), AoLi(fs) would be different 
from AoL 2 (fs)> and the closed-loop gain would switch between two values, 
resulting in IMD. If, on the other hand, the GBW were the same in both phases 
(Figure 7), then AoLi(fs) would be the same as AoL 2 (fs), and the closed-loop gain 
would be constant regardless of whether the circuit is operating in phase 1 or 
phase 2. Of course, mismatch of the gain-bandwidth products will leave residual 
IMD products. For signals with frequencies lower than the pole frequeney 
during phase 1, the large open-loop gain will heavily attenuate any distortion so 
the IMD will be small. 





Figure 6: Figure 7: 



GBW^, <> GBW ^2 =>IMD 



GBW^i = GBW ^2 ^ no IMD 
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Figures 8 and 9 show the residual IMD of a feedforward autozeroed op-amp 
with a switching frequency of 7.5kHz; switching transients can be seen at 
7.5kHz, 15kHz, and 30kHz. In Figure 8, a IkHz signal gives IMD products at 
6.5kHz and 8.5kHz, with amplitudes 84dB and 83dB below the signal 
respectively. In Figure 9, a 2kHz signal gives IMD products at 5.5kHz and 
9.5kHz, with amplitudes 74dB and 76dB below the signal respectively. 




Figure 8: DC to 50kHz spectrum Figure 9: DC to 50kHz spectrum 
showing IMD with IkHz signal showing IMD with 2kHz signal 
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4. Ping-pong autozero architecture 

The ping-pong autozero architecture [8] is another way to provide a continuous 
signal path using autozeroing. The concept is shown in Figure 10. 

During phase 1, gmi is autozeroed using g^i- Meanwhile, the input signal is 
amplified using gms and gms- During phase 2, g^s is autozeroed using gm4- 
Meanwhile, the input signal is amplified using gmi and gms- The architecture is 
easier to design and lay out than the feedforward autozeroed architecture 
because gmi and gm 3 are identical, and gm 2 and g „4 are identical. The gain- 
bandwidth product is the same for both phase 1 and phase 2, so IMD will result 
only from mismatches between gmi and gm 3 - 




Figure 10: Ping-pong autozero architecture 
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5. Chopping concept 

Chopping [9] is another way to obtain very low offset amplifiers. The idea is to 
use one multiplier at the inputs and one multiplier at the output of an amplifier. 
Consider an amplifier with a gain of A and an input offset of Vos- In phase 1, 
both multipliers have a gain of +1 (see Figure 11). The effective input offset 
during this phase is +Vos- In phase 2, both multipliers have a gain of -1 (see 
Figure 12). The effective input offset during this phase is -Vqs- If both phases 
have a 50% duty cycle then the average offset over both phases becomes 0. 



Vi+ o- 

V,- 



Vos 

_i + 



fViN 



+ fVlN fVlN + ypS 




A(Vin+Vos) - 




A(Vin+Vos) 



Figure 11: Chopping, phase 1; input offset voltage is +Fos 




A(-Vin+Vos) 




Figure 12: Chopping, phase 2; input offset voltage is - Vqs 

Since the input offset is +Vos on phase 1 and -Vos on phase 2, chopping 
introduces a square wave with amplitude 2Vos peak-to-peak at the input. This 
square wave is then amplified by the amplifier. Another way to describe this 
effect is that the ±1 multiplication is a modulation process. The DC offset and 
1/f noise from a normal amplifier (shown in Figure 13) is modulated only once 
(by the output multiplier), so the offset and 1/f noise is modulated from DC to 
the chopping frequency (see Figure 14). The input signal, on the other hand, is 
modulated by the input multiplier and demodulated again by the output 
multiplier, so its spectrum (apart from the amplifier gain) remains unchanged. 
This assumes the amplifier has enough bandwidth to pass all the harmonics of 
the modulated signal. 
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Figure 13: Noise from op-amp Figure 14: Noise from op-amp 

without autozeroing nor chopping using chopping 

Of course, a large square wave from the amplified, modulated offset is 
undesirable at the output so chopping is usually done at a frequency much higher 
than the signal frequency, and the chopped output is low-pass filtered. 



6. Chopping implementation 

Chopping can be implemented very easily if the amplifier has differential inputs 
and outputs (see Figure 15). A gain of -1 at the amplifier input during phase 2 is 
implemented using switches to invert the input signal. The -1 gain at the 
amplifier output is implemented in the same way. A gain of +1 at the amplifier 
input during phase 1 is implemented using switches to connect the amplifier to 
the input signal without inversion. 




Figure 15: Simple implementation of chopping 

When the circuit in Figure 15 is used to drive resistive loads, there will be an 
instant when all the switches are open, i.e. the load is not driven. This results in 
large output glitches. To reduce these switching glitches, a second gain stage can 
be used as an output buffer [10], as shown in Figure 16. If external feedback is 
used then a compensation capacitor Cc is also needed, changing the output 
buffer into a first-order low pass filter. This will help to attenuate the modulated 
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offset, although the modulated offset will be too large for most applications 
unless the chopping frequency is several orders of magnitude higher than the 
filter bandwidth, or additional filtering is used. 




Figure 16: Chopped amplifier with output buffer 



High-order filters would remove the modulated offset more efficiently than a 
first-order filter, but the resulting op-amp will be more difficult to stabilize when 
used with an external feedback network. Nevertheless, one family of op-amps 
has been released using chopping together with 4th-order filtering [11, 12]. The 
maximum input offset is 10|a.V but the most impressive parameter is the input 
noise: 6nV/VHz from IHz to 2kHz (although 2mA of supply current is needed). 



7. Comparison of autozeroing and chopping 

Comparing Figures 5, 10, and 16, it can be seen that using autozeroing to 
continuously amplify signals requires more circuitry than using chopping. The 
larger circuitry and storage capacitors for autozeroing take more die area. 

Another weakness of autozeroing is that wideband noise is aliased when the 
offset is sampled onto the memory capacitors. This results in low-frequency 
noise typically two or more times higher than the thermal level. Figure 17 shows 
the noise power spectral density (PSD) of a normal op-amp using neither 
autozeroing nor chopping. There is offset and 1/f noise at low frequencies, and 
thermal noise at high frequencies. Figure 18 shows the noise PSD for an 
autozeroed op-amp. The noise is greater at low frequencies due to the aliasing of 
wideband noise. 

By comparison, the low-frequency noise for chopped amplifiers (Figure 19) is 
near the thermal level. However, DC offset and 1/f noise is modulated to the 
chopping frequency. 
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op-amp without op-amp using op-amp using chopping 

autozeroing nor autozeroing 

chopping 

Most commercial sub-10|xV offset op-amps use autozeroing rather than 
chopping because autozeroing produces much smaller artifacts at the switching 
frequency compared with chopping. The smaller artifacts is important because 
the manufacturers have chosen to use switching frequencies between 2kHz and 
15kHz (a little higher than most signal bandwidths) to keep the input bias 
current (due to charge injection from the input switches) low. 

If the switching frequency can be made several orders of magnitude higher than 
the signal frequency, or if additional filtering can be used, then chopping may be 
more attractive than autozeroing: the modulated offset can be filtered easily with 
simple filters, the die area is smaller, and less power is needed for a given noise 
level. 



8, Chopped Autozero Architecture 

An op-amp recently introduced uses both chopping and autozeroing [13, 14]. 
The goal was to obtain the low-frequency noise of a chopped op-amp without 
the modulated offset at the chopping frequency (see Figure 20). To combine 
autozeroing and chopping, a ping-pong autozeroed op-amp (Figure 10) was used 
as a starting point. For each of the two autozeroing phases, the signal path is 
chopped once, giving four phases in total. The resulting circuit is shown in 
Figure 21. The circuit complexity is slightly greater than the ping-pong 
autozeroed op-amp, but the noise is significantly lower at low frequencies (the 
frequency of interest for most low-offset applications). The operation of each 
phase is shown in Table 1. 
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Figure 20: Noise spectrum from op-amp 
using both chopping and autozeroing 




Figure 21: Op-amp architecture using both chopping and autozeroing 
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Phase 1: 


gm, autozeroed using g^a 

Input signal amplified using gm 3 and gms 

without inversion at gn ,3 


Phase 2: 


Input signal amplified using gms and gms 
with inversion at gm 3 


Phase 3: 


gm 3 autozeroed using g ^4 

Input signal amplified using gmi and gms 

without inversion at gmi 


Phase 4: 


Input signal amplified using g^i and gms 
with inversion at gmi 



Table 1: Operation of chopped autozeroed op-amp shown in Figure 21 

Figure 22 shows the wideband noise spectrum of the op-amp, demonstrating the 
same behavior as Figure 20. The autozero frequency is 7.5kHz, and the 
chopping frequency is 15kHz. Figure 23 shows the low-frequency noise 
spectrum: the noise at IkHz is about 20nV/VHz, which compares well against 
autozeroed op-amps which typically exhibit 70nV/VHz or more. The maximum 
offset is specified at 10|iV. 




Figure 22: DC to lOOkHz 
noise spectrum 



Figure 23: DC to 20kHz 
noise spectrum 
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10. Summary 

The use of autozeroing or chopping or both have allowed op-amps to be 
manufactured with offsets lower than 10|xV. Since the offset is continuously 
corrected, the low offset is maintained over time, over temperature, and over 
input signal voltage. This translates to low drift and a high common-mode 
rejection ratio. 

The strength of autozeroed op-amps is that little energy is introduced at the 
switching frequency, allowing them to be used with signals with wider 
bandwidth than the autozero frequency. For this reason, most sub-10|LiV offset 
op-amps are autozeroed. The weakness of autozeroing is that noise is aliased 
into the low-frequency band when the offset is sampled. Another weakness is 
that additional circuitry is needed to process signals continuously since the first 
gain stage needs to be removed from the signal path periodically for autozeroing. 
This additional circuitry, together with the memory capacitors needed to store 
the correction signal, generally gives a die area larger than a chopped amplifier. 

The strength of chopping is that the low frequency noise is near the thermal 
noise level. The weakness is that the pre-chopped offset is modulated to the 
chopping frequency; this modulated offset can be difficult to remove. One way 
to remove the modulated offset is to chop at a frequency several orders of 
magnitude higher than the signal bandwidth, then use simple (first or second 
order) filtering. The disadvantage of using a very high chopping frequency is 
that both the offset and input bias current will increase. An alternative way to 
remove the modulated offset is to use a high order filter. However, an op-amp 
using high-order filtering can be more difficult to stabilize when used with 
external feedback. In both cases, additional components are needed to 
implement filtering. 

Combining autozeroing with chopping gives the benefits of low noise density at 
low frequencies, and low energy at the switching frequencies; the circuit 
complexity is only slightly greater than an autozeroed op-amp. 
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Abstract 

The research on large sensor networks along with the trend to 
miniaturization and the realization of an increasing number of 
components on a single chip (system on chip, SOC) create a 
strong demand for systems that combine multiple sensors or 
sensor-arrays together with their assoeiated circuitry and 
interface units in a single-chip solution. In this article, the 
requirements to realize such multi-sensor interface chips are 
discussed. The main focus is on the analog frontend and the 
system architecture of the chip. 



1. Introduction 

The growth of information technology (IT) over the past years has created a 
large demand for all kinds of sensors and actuators that enable automated 
interaction of machines with their environment or human beings. Sensor 
networks and smart sensors for environmental investigation can be found 
among the top priorities in the research programs of the European Union [1] 
and the USA. Environmental investigation of, e.g., air quality requires 
simultaneous measurements of several important parameters and gases 
(temperature, humidity, carbon monoxide, nitrogen oxides NOx). These 
multiple sensors should be combined in a single device or even in a single 
chip to achieve a miniaturized, autonomous, and cost-effective solution. In 
this article multi-sensor interfaces are discussed, which are necessary to 
combine several different sensors on a single chip. Another application of 
multi-sensor interfaces includes large arrays of identical sensors, which are 
realized in well-known imaging devices such as CMOS cameras and 
fingerprint sensors. Large sensor arrays are also developed in 
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biotechnological research (e.g., neural interfaces or large assays for drug 
screening). 

In Section 2 the requirements and design choices for the interfaces of the 
chip towards the sensors and towards the signal-processing circuitry are 
described. Then the system architecture of an interface chip is outlined and 
the requirements for the various building blocks are discussed. In Section 3, 
different circuit implementations of the key building blocks will be given. 
Section 4 contains the conclusions. 

2. Chip Architecture for Multi-Sensor Interfaces 

2.1. Sensor Interface and Fabrication Technology 

An early decision to be made in the process of designing a multi-sensor 
system is the choice between a monolithic solution, in which the sensors and 
the interface circuitry are both fabricated on the same substrate, and a hybrid 
solution, in which the sensors and circuitry are fabricated on separate 
substrates and often use different technologies. The final choice is a trade- 
off between the better performance and higher flexibiUty of the hybrid 
solution and the low complexity and low cost of the monolithic solution. For 
single-sensor systems there are commercial examples of monolithic [2, 3] as 
well as hybrid solutions [4, 5]. A detailed discussion can be found in [6]. 

Hybrid solutions are used when it is difficult or impossible to merge the 
technologies for sensor fabrication and interface circuitry (e.g., because of 
high-temperature annealing steps for the sensor structures, which would 
destroy the CMOS metallization layers of the interface chip). They are 
difficult to realize for multi-sensor systems because they result in a large 
number of input/output (10) connections between the interface chip and the 
sensor devices. Wirebonds are limited to the periphery of the chips and lead 
to large parasitic capacitances and inductances. Therefore, advanced area 
array packaging technology, such as Flip-Chip [7, 8], has to be employed, 
which uses the entire area of the interface chip for interconnects and allows 
the stacking of devices to reduce the area and parasitics. 
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Fig. 1: PbSn bumps for flip-chip packaging after reflow 
(pitch 50 pm, diameter 25pm); from [9], © 2000 SPIE 

Fig. 1 shows a chip with solder bumps at a pitch of 50 pm that was 
developed for a pixel detector. Recently, a device-transfer technology has 
been introduced by Despont et al. at IBM [10] that allows a further increase 
of the interconnect density. Fig. 2 shows an array of micromachined 
cantilevers for probe-storage applications [11] that has been transferred onto 
a wiring wafer. 




Fig. 2: Scanning electron micrograph of a section (and a close-up view) of a 
cantilever array transferred onto a wiring wafer using device-transfer 
technology; from[10], ©2003 IEEE. 

Monolithic solutions use the wiring levels of the interface chip to connect 
the sensor device and the interface circuitry. The width and spacing of the 
wiring levels is on the order of 1 pm, which allows the highest interconnect 
density. Furthermore, crosstalk, parasitic capacitance, and parasitic 
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inductance are minimized. Monolithic solutions for multi-sensor systems 
include CMOS imagers, fingerprint sensors and integrated gas-sensors. 

There is no general rule for the selection of the fabrication technology for 
multi-sensor interface chips, but some guidelines can be given: 

• CMOS vs. BiCMOS, bipolar and dedicated sensor processes: There 
are only very few applications that benefit sufficiently from the 
availability of bipolar transistors to justify the larger cost entailed. 
Furthermore, a large number of CMOS processes is available through 
inexpensive multi-project wafer (MPW) prototyping services such as 
Europractice or MOSIS [12, 13]. 

• Feature size: Multi-sensor interfaces lag the scaling trend of digital 
CMOS because of their high content of analog circuitry and the large 
size of many sensor devices. Furthermore, the low supply voltages of 
advanced CMOS technologies often are not sufficient to bias the 
sensor devices. The only exceptions to this rule are systems that 
include a large amount of data-processing on-chip (e.g., imagers with 
on-chip filtering). 

• Prototyping services: For many multi-sensor applications, MPW 
services can only be used at an early stage of development. Later on, 
complete wafers are required because (a) the packaging is done on the 
wafer level or (b) sensor fabrication for monolithic solutions is 
performed on the wafer level. In this case, established CMOS 
technologies that, compared with the latest technology, involve low 
mask costs are preferred. 

• Analog options: Many sensor interfaces require analog options such 
as high-voltage lOs, high-resistivity layers, and linear capacitors. 
These analog options often are not yet available for the latest digital 
CMOS technology. 

As a result, the majority of interface chips currently uses established 
standard CMOS technology (0.25 pm - 1 pm) enhanced by selected analog 
options (high-voltage 10, linear capacitors) and postprocessing for 
monolithic solutions [6]. 

2.2. A Template for Multi-Sensor Interface Chips 

Fig. 3 shows a block diagram of a typical multi-sensor interface chip. The 
small signals from the sensors are first amplified and filtered by the analog 
frontend. After analog-to-digital conversion, the digital signals are 
transmitted to an external data-processing unit. 
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F/g. 3: Template for a multi-sensor interface chip 



Owing to the large number of sensors, several auxiliary building blocks 
have to be included (power management, analog/digital test circuitry, test 
interface, biasing). In the template in Fig. 3, two separate interfaces are 
used: A slow serial interface is used to set parameters, perform calibration 
functions, and receive/transmit status information, whereas a fast dedicated 
interface is used to transmit the measured data. If the data rate is sufficiently 
low, the two interfaces can be merged. 

The large number of devices leads to specific requirements for the 
architecture of multi-sensor interface chips: 

• Minimum number of lO connections: Many sensor applications 
require custom packages because the sensor needs a “window to the 
outside world” so that the sensor has access to the quantity to be 
measured (e.g., analyte-loaded air for gas sensors). Furthermore, the 
choice of packaging material is often restricted by the sensor 
application (e.g., to avoid outgassing). This increases the cost of lO 
connections compared with standard products and, therefore, the 
number of lOs must be minimized. A review of microsensor 
packaging can be found in [14]. 

• Suppression of noise and crosstalk in the analog frontend: The 
combination of analog and digital circuitry as well as a large number 
of simultaneously active sensors lead to crosstalk and large substrate 
noise. 
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• Design for testability: In many cases, the analog inputs of the 
interface chip are not accessible through external lO connections. 
This makes it difficult to test the interface circuitry. 

• Self-test and calibration: The sensor interface must include circuitry 
that allows the calibration of each individual sensor. 

In the following, these issues are discussed, and some guidelines for the 
system architecture of multi-sensor interface chips are derived. 



2.3 Sensor Arrays vs. Combination of Different Sensors 

Multi-sensor systems can be divided into two categories: 

• Arrays of identical transducers: The main architectural choice for 
large sensor arrays is between cell-level implementations, where each 
sensor element has its own dedicated circuitry, and block-level 
implementations, where each cell has only minimal functionality and 
the major building blocks are multiplexed between the sensor 
elements. Furthermore, the amount of circuitry can be significantly 
reduced by choosing an architecture that allows intelligent sharing of 
resources. 

• Different sensors combined on a single chip: Sharing of resources 
is difficult for systems containing several different sensors. Therefore, 
the main focus is on power optimization, area minimization, crosstalk 
avoidance, and noise immunity of the individual sensors. The use of 
separate interfaces also facilitates the power management of the chip. 

2.4 Data Processing 

Today, most multi-sensor systems perform the processing of the data on a 
separate system (microprocessor, DSP) because the algorithms to perform 
the preconditioning of the data (normalization, transformation) and the 
classification (such as qualitative vs. quantitative analysis) vary from 
application to application. Furthermore, sensor devices are often fabricated 
using a less advanced technology than that of the latest generation of 
microprocessor and DSPs. 

For large sensor arrays, research has already gone one step further, and data 
processing is partially performed on the sensor chip. This reduces the 
amount of data that has to be transferred off-chip. Furthermore, certain 
operations are performed more efficiently in parallel on a small subset of 
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elements than by an external DSP (e.g., filters to extract features such as 
movement or edges that involve only a small number of neighboring 
elements). 



2.5 Substrate Noise and Crosstalk 

Most multi-sensor interface chips require on-chip digital circuitry to transfer 
data, set parameters, perform calibration, and execute power-management 
commands. This causes substrate noise, which affects the weak signals at 
the sensor interface. Furthermore, crosstalk from other sensors (uncorrelated 
in the case of multiple different sensors on a single chip) also influences the 
sensor via substrate or power-supply lines. This translates into strict 
requirements for the rejection of power-supply and common-mode signals 
(PSSR, CMRR). These requirements are best met by fully differential 
sensor frontends. If possible, the sensor should already provide a differential 
input signal, which is then processed by fully differential analog frontend 
circuitry. Fully differential amplifiers for analog frontends are presented in 
Section 3.1. 



2.6 Power Consumption 

Many multi-sensor systems are designed for use in mobile devices and 
hence must be suitable for low-power operation from an unstable battery 
supply. There are two key elements to meet the resulting power 
specifications: 

1. On-chip power management: This is especially important for 
systems with many different sensors because the individual sensors 
often have large differences in their time constants. By providing the 
possibility to operate each sensor only for the amount of time required 
to perform a measurement, the overall power consumption can be 
optimized. For sensor arrays, for example in imaging applications, 
savings in power can be achieved by operating the system at a lower 
resolution and by switching off the elements not in use. 

2. Sharing of resources: For arrays of identical sensors, the largest 
savings in power can be realized by choosing an architecture that 
shares the power-hungry elements between all sensor cells and uses 
only a minimum number of building blocks on a per-cell basis. 
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2.7 Test of Multi-Sensor Systems 

There are two types of tests for mixed-signal ICs: 

• Functional testing: Detailed characterization of a few fabricated 
chips to verify that the performance of every single building block on 
the chip is in accordance with the specifications and simulations. 

• Production test: This type of test assumes that all building blocks 
have already been verified. The goal of the production test is to 
identify defective chips in a production lot (shorts, open connections, 
excessive contact resistance). 

It is important that testing issues are considered at an early stage of the 
design of multi-sensor systems (design for testability, [15]) and that a few 
basic mles are observed: 

• The analog inputs of the interface circuitry often are not longer 
accessible once the sensor has been connected. A multiplexer at the 
input is required to provide access for external test signals. 

• The outputs of intermediate building blocks (e.g., input amplifier, 
filter) should be multiplexed to analog outputs. A buffer amplifier 
with appropriate swing and bandwidth must be used to avoid large 
loads and noise coupling to sensitive nodes. 

• The testing of sensors with real-world stimuli is sometimes difficult 
and time-consuming. If possible, test circuitry that can simulate 
sensor signals should be included. In the case of a gas sensor, which 
measures the heat generation during the absorption of an analyte 
(enthalpy change), a heating element can be included to generate a 
known amount of heat. 

• Typical testing time for the production test of a mixed-signal chip is 
on the order of a few seconds. In many cases, serial testing of the 
individual interfaces on a multi-sensor interface chip consumes too 
much time. An on-chip digital controller that performs the tests in 
parallel and transmits the results through a dedicated test interface 
(which can also be used for calibration) solves this problem. 

2.8 Application Example: CMOS Gas Detection System 

Fig. 4 shows the block diagram of a monolithic multisensor chip for gas 
detection [16, 17]. It comprises three different transducers that respond to 
fundamentally different properties of the analyte-loaded air. In combination 
with an on-chip temperature sensor and off-chip data processing, gas 
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mixtures can be classified (qualitative analysis with pattern-detection 
algorithms) or quantified using this chip. 
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Fig. 4: Block diagram of CMOS gas detection system 



The first transducer is a micromachined cantilever oscillating at its 
resonance frequency. The absorption of analyte in a chemically sensitive 
polymer on the surface of the cantilever causes shifts in the resonance 
frequency. The second transducer is a planar capacitor with polymer-coated 
interdigitated electrodes to monitor changes of the polymer dielectric 
constant upon analyte absorption. The third transducer is a thermoelectric 
calorimeter (Seebeck effect) that detects enthalpy changes during absorption 
or desorption (transient signal). 

The temperature sensor is needed to take into account the strong 
temperature dependence of the analyte absorption. Some of the key building 
blocks (first amplification stage, ADC) of the three transducers are detailed 
in Section 3. A chip photograph is shown in Fig. 5. 



Digital interface: 

The data rate for chemical sensors is low (< 1 kSamples/s). Therefore, the 
two separate interfaces for data-transmission and calibration (see Fig. 3) 
have been merged. A custom I2C interface is used to transmit the data 
(master mode instead of polling to reduce bus load). 
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Fig. 5: Chip photograph of CMOS single-chip gas-detection system 



Sensor Interface and Fabrication Technology: 

The chip was fabricated in a standard 0.8-pm CMOS technology with two 
levels of metal and linear poly/poly capacitors. The sensors were fabricated 
by using some of the intrinsic processing steps of the CMOS technology and 
CMOS-compatible postprocessing. Because of the monolithic integration of 
the sensors, it was possible to use the wiring levels of the CMOS process to 
connect the sensors to the interface circuitry. Flip-chip packaging onto a 
ceramic substrate was used as a first-level packaging. The ceramic substrate 
is needed to prevent outgassing of standard packaging materials, which 
would create interference with the sensor signal. A hole in the ceramic 
substrate allows the sensors to access the analyte-loaded air while the rest of 
the circuitry is protected by a sealing ring and the flip-chip underfill (Fig. 6). 




Fig. 6: Schematic drawing of the flip-chip packaging approach 
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Test of the single-chip gas-detection system : 

Fig. 7 shows a schematic drawing of the test setup for the single-chip gas- 
detection system. The digital part of the chip (controller, digital interface) is 
tested using automatically generated test patterns for the scan-chain 
architecture. 
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Fig. 7: Setup for the test of the single-chip gas-detection system 



The analog circuitry and the sensor elements must be tested through the 
digital interface because the flip-chip packaging approach does not allow 
additional test pads. The testing time is reduced by employing the scan 
chain to load a specific state that is needed for a certain test. Next, the chip 
is switched from scan mode to normal operation until the analog circuitry 
has settled to the final value. Finally, the results, which are stored in the 
output registers of the ADC, and some dedicated test registers are read out 
via the scan chain again. This method allows a rapid setting of the 
parameters as well as parallel testing of all sensor elements and their 
interface circuitry (details can be found in [16]). 



3. Building Blocks for Multi-Sensor Interfaces 

In the following, some of the key building blocks for multi-sensor interfaces 
will be described. After an overview of fully differential frontend 
amplifiers, the analog-to-digital converter (ADC) and the digital interfaces 
will be discussed. 



3.1 Amplifiers 

The first amplification stage determines the overall performance of the 
system with respect to signal-to-noise ratio (SNR), power-supply rejection 
ratio (SNRR), linearity, and often also power consumption. A fully 
differential design is desirable for the amplification of small signals in 
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multi-sensor interfaces because it offers superior rejection of substrate and 
power-supply noise. 



3.1.1 Instrumentation Amplifier 




Fig. 8: Instrumentation amplifier, gain Ay=l +2RJRh 

Fig. 8 shows the classic instrumentation amplifier. It can easily be designed 
for rail-to-rail output swing and, with a combined PMOS/NMOS input 
stage, also rail-to-rail input swing can be achieved. The disadvantages of 
this configuration are that it requires two operational amplifiers (increased 
power consumption) and that it is only pseudo-differential (transparent to 
common-mode signals). 



3.1.2 Fully Differential Inverting Operational Amplifier (OpAmp) 



Ra 




Fig. 9: Fully dijferential inverting operational amplifier 

Fig. 9 shows a fully differential version of the standard inverting amplifier. 
The gain is given by Av = Ra/Rb and the common-mode voltage at the output 
is defined by an additional feedback loop inside the amplifier. The 
disadvantage of this amplifier is the resistive load, which has to be driven by 
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the sensor (Rb) and the second amplification stage of the OpAmp (Ra). 
Furthermore, additional power is needed for the second differential input 
stage, which is needed for the common-mode feedback. 

The fully differential OpAmp is often used in switched capacitor interfaces, 
where the feedback resistors are replaced by capacitors (e.g., the switched 
capacitor ZA-modulator in Fig. 10). Fig. 10 shows a circuit implementation 
of this amplifier, and summarizes its performance. 




Fig. 10: Schematic of the operational transconductance amplifier for a fully 
differential switched-capacitor amplifier 

A single-stage folded-cascode amplifier provides high DC gain and is able 
to drive the capacitive load. The second differential input stage can be 
avoided by using switched-capacitor techniques for the common-mode 
feedback (CMFB). The NMOS current sources at the bottom of the 
amplifier are split to allow independent optimization of the noise 
performance and the bandwidth of the CMFB. 

3.1.3 Differential-Difference Amplifier (DDA) 




Fig. 11: DDA for analog frontends, gain Ay=l +2RyRb 
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The DDA shown in Fig. 11 reduces the number of components compared 
with the classic instrumentation amplifier. It amplifies a difference between 
the two input voltages. 




Fig. 12: Schematic of a two- stage DDA 

Fig. 12 shows the schematic of a DDA. The two input voltages are 
converted to currents by the two differential pairs formed by the transistor 
pairs Tla and Tib. Then they are subtracted in the current-summing nodes. 
This is followed by a standard folded-cascode amplification stage. The 
DDA uses a feedback loop to define the common -mode voltage at the 
output of the amplifier. Fig. 12 also reveals the disadvantage of the DDA: 
The input stage has to handle the full differential input-swing of the sensor 
signal and, therefore, good linearity can only be achieved for small input 
signals. 



3.1.4 Open-Loop Amplifier 

The preceding amplifiers all employ OpAmps with large DC gain and 
feedback. The gain of these amplifiers is accurately defined by the ratio of 
linear feedback elements (resistors, capacitors), and the linearity is 
determined by the excess open-loop gain. This comes at the cost of extra 
power needed to drive the feedback resistors, to bias the second 
amplification stage, and to realize a large bandwidth. 

A different approach is to use amplifiers in which the gain is defined as the 
ratio of the transconductance (gm) of two transistors rather than by using 
feedback. 
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Fig, 13: Schematic of gm-ratio amplifier 



Fig. 13 shows an example a gm-ratio amplifier [17, 18], where the gain is 
given by 



A,= 



gm^ 






4/?/ 



( 1 ) 



This amplifier is only suitable for small input signals. The input swing is 
determined by the linear input range of the differential stage. Further 
reduction of the power consumption (by a factor of 2.5 for the same 
performance) can be achieved by operating the transistors in weak 
inversion, where the gain is proportional to the ratio of the bias currents 
[17]. 



3.2 Analog-To-Digital Converters (ADCs) 

A general discussion of ADCs would exceed the scope of this article. A 
good overview of integrated ADCs can be found in [19]. This section 
analyzes the specific issues that affect the choice of the ADC for multi- 
sensor systems. ADC architectures and their capabilities are compared in 
[20, 21]. Low-power designs for sensor applications have been realized in 
many different architectures, e.g., SA-ADCs [22-24], successive-approxi- 
mation ADCs [25], and algorithmic ADCs [26]. 

3.2.1 ADCs for Sensor Arrays 

There are two possible approaches for sensor arrays with a large number of 
elements; either one fast ADC is used to convert the multiplexed signals 
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from all elements or each individual element employs a slow ADC. In the 
latter case of a cell-level ADC, small area and low power consumption are 
the most important features. This makes single-slope ADCs and dual-slope 
ADCs an attractive choice [27]: 

• A m-bit single-slope ADC consists of only a small number of 
elements (comparator, integrator, m-bit latch, m-bit counter, ramp 
generator) 

• They are inherently monotonic and linear. 

The counter and the ramp generator can be shared between all cells, thus 
reducing the size of the ADC. In [28], Yang et al. developed a modified 
conversion algorithm (novel ramp generator) that also eliminates the need 
for a latch inside the cell. As a result, four detectors and the ADC of their 
320 X 256 pixel CMOS imager (0.35-pm technology) fit into a 20 x 20 pm^ 
cell. A 8-bit first-order EA ADC (cell area 30 x 30 pm^) [29] and a 13-bit 2- 
step ADC (230 x 230 pm^) [30] have been presented for applications in 
which the area is less critical. 

The use of cell-level ADCs allows data to be processed already in the sensor 
array. This reduces the amount of data that has to be transmitted off-chip 
and can eliminate the need for additional data-processing hardware. 



1 / =; citik -fl. 




a) 




b) 



Fig. 14: Fingerprint sensor with cell-level ADC and on-chip data 
processing; a) pixel circuitry and threshold detector, b) basic structure of 
processing cell with its six neighbors; from [31], © 1999 IEEE 
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Fig. 14 shows a cell of the CMOS fingerprint sensor realized by Jung et al. 
[31]. They use a capacitive sensing scheme to detect the capacitance 
between an on-chip sensing electrode and the finger. 

A threshold detector is used to distinguish between a ridge and a valley (see 
Fig. 14a). The cell-level ADC enables on-chip filtering (thinning) and 
extraction of features (ridge endings). Fig 14b shows the additional cell- 
level circuitry needed to perform the filtering operations. Each pixel uses 
the output of the six neighboring pixels to perform the filtering operations, 
which are initiated by global control circuitry. This way, no external signal 
processing hardware is needed. 

If a single ADC is multiplexed between all elements (or shared for one 
row), the size of the converter is not critical. For most sensors, the sampling 
rate is rather low (between a few samples per second and 100 ks/s), which 
leads to an overall sampling rate below 100 Ms/s for an array with 1000 
elements. Therefore, in most cases well-known high-speed ADC 
architectures, such as flash converters, pipelined converters, and folding 
converters, are used. 



3.2.2 Single-Bit Sigma-Delta ADCs 

For small sensor arrays and single-chip combinations of multiple sensor 
interfaces, single-bit SA-converters are the most popular choice for the ADC 
because of the following features: 

• High accuracy can be achieved without reference-voltage scaling; no 
accurate matching of resistors or capacitors is required. 

• No calibration required. 

• They are faster than single-slope and dual-slope converters. 

• They are strictly monotonic. 

• They contain only a small number of elements (small area). 

The specifications of the anti-aliasing filter are relaxed because of the 
oversampling. 



first integrator second integrator comparator flip-flop 




Fig. 15: Signal-flow graph of second-order ZA modulator 
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Fig. 15 shows the signal-flow diagram of a second-order SA modulator. 
The output Y of the converter is given by 

Y = z~^^^*(X+Q*(l-z~'f) (1) 

where Q is the quantization error added by the comparator. The signal- 
transfer function Y/X for DC signals is unity and the SNR increases by 
roughly 40 dB for each tenfold increase of the sampling rate. Fig. 16 shows 
a fully-differential switched-capacitor implementation of this SA-ADC. 




Fig. 16: Fully differential second-order switched-capacitor FA modulator 

This SA modulator requires only a single-ended reference voltage Vref, 
which is defined relative to the common-mode voltage (analog ground in 
Fig. 16). The basic architecture of the SA modulator can easily be adapted to 
perform additional tasks: 

• By changing the oversampling factor and/or the decimation filter, 
resolution can be traded for conversion rate and power consumption 
without modification of the converter. 

• The sensor element can be incorporated into the modulator, e.g., for 
capacitive sensors the sensing capacitor replaces the input capacitor 
Cin. The same principle has also been applied to resistive and 
inductive sensors. 

• Reference elements that compensate for sensor offsets can also be 
incorporated. 

• The generation of the reference voltage or compensation signals can 
be performed inside the modulator. This approach has been used in 
[16] to design a SA temperature sensor that generates the PTAT 
voltage as well as the temperature-independent bandgap reference 
voltage internally. 
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3.3 Digital Interfaces 

Based on the template for multi-sensor systems in Fig. 3, two digital 
interfaces are required: 

• A slow digital interface to set parameters and operation modes, to 
perform calibration, and to read status information. 

• A fast digital interface for data transmission. 

Serial interfaces are a common choice because they require only a small 
number of 10 connections. Numerous different serial interfaces have been 
described in the literature. Over the past few years, some standards for 
sensor interfaces have been established, and some interfaces are widely used 
in certain areas of application: 

• IEEE 1451.1-4 [32]: This standard was established only a few years 
ago and is not yet widely used. The large number of connections (10 
10 ports), the lack of bus capability, and the need for on-chip memory 
to store the “Transducer Electronic Data Sheet (TEDS)” render it 
difficult to use for integrated sensor interfaces. 

• SPI bus: Widely used in the USA; included with many 
microcontrollers. 

• I2C bus: Developed by Philips, parts of it are proprietary; widely 
used in Europe (with small modifications). 

• CAN bus: The standard bus interface used in the automotive 
industry. 



4. Conclusions 

Many sensor applications require combined information from multiple 
sensors or sensor arrays to perform multi-parameter analysis or on-chip 
compensation of cross-sensitivities. Together with the trend to integrate 
more components of a system on a single chip (system on a chip, SOC), this 
leads to complex systems. The architecture has to be selected carefully with 
respect to the number of 10 connections, fabrication technology, packaging, 
calibration and test capabihties, processing options, and power 
consumption. The design of multi-sensor interfaces requires analog frontend 
circuits that can operate reliably despite large substrate noise and without 
accurate external reference signals. 
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Abstract 

Sensor arrays for fully electronic DNA detection based on an 
extended CMOS process are discussed. An introduction to the 
used DNA detection method and to the modified CMOS process 
is given, and special emphasis is put on design issues of circuit 
blocks whose function is particularly related to the 
electrochemical operation principle of the chip. Stability and 
accuracy aspects are considered in detail. Measured data are 
shown from electrical characterizations as well as from 
biological experiments. 



1. Introduction 

In many fields of biotechnology tools are required to detect the presence or 
absence of specific DNA sequences in a given sample. The development of 
so-called DNA microarrays in recent years has opened the way to high 
parallelism and high throughput [1-9]. The most widely known applications are 
genome research and drug development. Moreover, exploration of these tools in 
the field of medical diagnosis is under development. While today’s available 
platforms utilize optical detection principles, electronic readout techniques 
provide a number of advantages, since they avoid expensive and comparatively 
complicated optical set-ups so that they open the way to user-friendly and robust 
systems. However, the status of development is lower compared to optical 
systems. 

In 2002, the electrochemical redox-cycling detection method [10, 11] was 
applied to chips fabricated on the basis of an extended CMOS technology. In 
this article, circuit design issues are reviewed which are specifically related to 
that method. 
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The article is organized as follows: In sections 2-4, brief introductions are given 
concerning the operation of DNA microarrays in general, the redox-cycling 
detection method, and the extended CMOS process used. In section 5, design 
issues are discussed concerning an on-chip potentiostat which is required in all 
cases where electrochemical methods are applied. In sections 6 and 7, circuits 
for the realization of active sensor sites are considered providing an analog data 
transmission concept through the whole array in the first case, and a sensor site 
approach with digital output in the second case. Finally, in section 8, the article 
is summarized and a brief outlook on fiiture developments is given. 



2. Basic Operation Principle of DNA Microarrays 

The purpose of DNA microarrays is to enable the parallel investigation 
of a given sample concerning the presence of specific DNA sequences. 
Depending on the application, requirements range from relatively simple 
’’presence or absence” investigations to quantitative analyses with a high 
dynamic range. 

A DNA microarray chip is a slide made of glass, a polymer material, 
Silicon, ... . On the surface of such a chip, single-stranded DNA receptor 
molecules (’’probes”) are immobilized at predefined positions within an area in 
the order of square millimeters to few square centimeters (Fig. la)). The probe 
molecules consist of different sequences of typically 20 - 40 bases. Various 
techniques are used to functionalize the chips: off-chip synthesized probe 
molecules can be deposited on the surface of such chips using microspotters [12] 
or moved to the required chip position using a dielectrophoretic technique [13]. 
Moreover, on-chip in-situ synthesizing methods exist using a lithography-based 
mask technique or light or electronic potential triggered alternatives [14-16]. 
Throughout this article, chips are spotted for functionalization. 

In Figs. Ib) and Ic), two different sites within the array are considered after the 
immobilization phase. For simplicity, only five bases are drawn in this 
schematic illustration. As shown in Figs. Id) and le), in the next step the whole 
chip is flooded with a sample containing the ligand or target molecules. In case 
of complementary DNA sequences of probe and target molecules, their match 
leads to hybridization (Fig. Id)). If probe and target molecules mismatch, this 
chemical binding process does not occur (Fig. le)). Finally, after a washing step, 
double-stranded DNA is obtained at the match-positions (Fig. If)). At the 
mismatch sites single-stranded DNA molecules remain so that the same situation 
as in the initial case shown in Fig. Ic) is achieved again. 

Since the receptor molecules are known, the information, whether double- or 
single-stranded DNA is found on the different test sites, reveals the composition 
of the sample. Thus, the remaining task is to identify the sites with double- 
stranded DNA. In the commercially available systems using state-of-the-art 
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Fig. 1: a) Schematic plot of a DNA microarray chip, b), c): Schematic 
consideration of two different test sites after the immobilization process. For 
simplicity, probe molecules with five bases only are shown here, 
d), e): Hybridization phase. A sample containing target molecules to be detected 
is applied to the whole chip. Hybridization occurs in case of matching DNA 
strands (d). In case of mismatching molecules (e), chemical binding does not 
occur.fi, g): Situation after subsequent washing step. 

optics-based readout techniques, the target molecules are labeled with 
fluorescence markers [1-9]. An optical image of the fluorescence signals from 
the array chip - taken after the hybridization and a subsequent washing step - 
reveals the positions with double-stranded DNA. 






68 



3. Redox-Cycling Detection Method 

The electrochemical sensor principle used here is based on an electrochemical 
redox-cycling technique [17-20]. A single sensor (Fig. 2, left) consists of 
interdigitated gold electrodes (generator and collector electrode). Width 
and spacing of the gold electrodes are of order 1 pm. Probe molecules are 
immobilized on the gold surface (e.g. by Thiol coupling). The target 
molecules in the sample are tagged with an enzyme label (Alkaline 
Phosphatase). After hybridization and washing phases, a suitable chemical 
substrate is applied to the chip (para-Aminophenylphosphate). The enzyme 
label - only present in case of matching DNA strands - cleaves the 
phosphate group and electrochemically active para-Aminophenol is generated 
(Fig. 2, right). 

Applying simultaneously an oxidation and a reduction potential to the sensor 
electrodes (Fge„ and Fco/, e.g. +300 mV and -100 mV with respect to the 
reference potential), para-Aminophenol is oxidized to Quinoneimine at the one 
electrode, and Quinoneimine is reduced to para-Aminophenol at the other one. 
The activity of these electrochemically redox-active compounds directly 
translates into electron currents at the electrodes (/ge„ and Ico/)- 
Since not all particles oxidized at the generator reach the collector electrode, a 
regulated four electrode system is used; a potentiostat, whose input and output 
are connected to a reference and to a counter electrode, respectively, provides 
the difference currents to the electrolyte, so that the potential of the electrolyte is 
held at a constant value. 

The current flow at the sensor electrodes results from the contribution initially 
generated by the enzyme label, and from the redox-cycling related contribution 
at the sensor electrodes. Due to electrochemical artifacts an offset current may 
contribute to the detection current, so that usually the derivatives of the sensor 
current with respect to the measurement time, dlcoi / dt and dlge„ / dt, are 
evaluated instead of the absolute values. A more detailed discussion of this 
method is given elsewhere [17-20]. 



4. Extended CMOS Process 

The basis of the extended process is a 6" n-well 0.5 pm CMOS process with two 
metal layers, high-ohmic poly-silicon resistors, and poly-poly-capacitors. Oxide 
thickness is 15 nm, and supply voltage is 5 V. After standard CMOS processing, 
the gold electrodes are fabricated in a lift-off process after evaporation of a 
Ti/Pt/Au stack with layer thicknesses of 50 nm / 50 nm / 300-500 nm, 
respectively. For the contact pads, gold metallization is used, too. Fig. 3 shows a 
tilted SEM cross section of a fully processed chip with sensor finger electrodes 
and CMOS elements. 
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Fig. 2: Schematic plot showing the redox-cycling sensor principle and the 
sensor layout. Left: Single sensor consisting of interdigitated gold electrodes 
and a potentiostat circuit with counter and reference electrodes. Right: Blow-up 
of a sensor cross-section showing two neighboring working electrodes after 
successful hybridization. For simplicity, probe and target molecule are only 
shown on one of the electrodes. 




Fig. 3: Tilted SEM cross 
section photograph with 
sensor electrodes and 
CMOS elements after the 
complete process run. Note 
that the nitride layer on top 
of the sensor electrodes is 
only used for preparation 
purposes. Width and 
spacing of the sensor 
electrodes are 1 pm. 



Specific annealing steps are introduced to guarantee both, good electronic 
CMOS frontend properties and good sensor properties in the process backend. 
As discussed in detail in [11, 21], an optimized process window was identified 
which allows to meet both requirements. 
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Fig. 4: Top view of a sensor 
with interdigitated gold 
electrodes embedded within 
a Polybenzoxazole ring used 
as comparetment. 
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Fig. 5: Chip photo of a fully processed demonstrator chip with 16 X 8 positions. 
Sensor diameter = 200 pm, sensor pitch = 300 pm, total chip dimensions = 
4.5 mm x 6.4 mm. 



Optionally, a compartment (Polybenzoxazole) is available surrounding the 
sensor area of each test site to avoid contamination of neighboring sensors 
during the spotting process. A top view of a sensor with interdigitated gold 
electrodes embedded within a compartment ring is depicted in Fig. 4. 
Fig. 5 shows an example of a fully processed demonstrator chip with 16 x 8 
positions. 
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5. Potentiostat 

In this section, circuit design aspects [22-24] are discussed concerning the 
potentiostat circuit required for the redox-cycling detection technique as 
explained in section 3. The complete regulation loop consists of elements 
depending on the electrochemical properties of the sample which determine the 
impedances of the electrodes (cf Fig. 8 and related discussion). A mandatory 
requirement for the potentiostat is loop stability. A minimum phase margin of 
70° is required under all operation conditions. For that reason we are aiming for 
a single-stage amplifier with sufficient gain and choose a folded cascode 
amplifier as shown in Fig. 6. 

This circuit is usually operated to drive capacitive loads with AC signals. Here, 
however, its DC current drive capabilities must be taken into account. To 
determine this parameter, we consider organization and architecture of the 
related DNA chip with 128 test sites (Fig. 7). Each of the 8 x 16 sensors is 
specified to provide sensor currents between 1 pA and 100 nA. Measured data 
for our electrode geometries (width = spacing = 1 pm) and for our redox system 
reveal a collection efficiency [18], 

C6 ^col / ^gen ? ( ^ ) 

with ce > 0.85 [18], so that maximally 15% of the maximum generator electrode 
current must be provided by the potentiostat via the counter electrode in order to 
keep the potential of the electrolyte at a constant value. This value amounts to 
approximately 128 x 0.15 x 100 nA = 2 pA. This current is taken from the bias 
current through the output branch of the folded cascode amplifier. To ensure 
negligible disturbance on the overall amplifier performance, we chose a 
sufficiently large biasing current in this branch, so that the maximum borrowed 
fraction is smaller than 1%. 

For a complete description of the regulation loop an equivalent circuit 
considering the electrochemical processes at the electrode-to-electrolyte 
interface is required. We describe a DC model first and then derive an AC 
model to investigate loop stability (Fig. 8). 

Current flow across the metal-to-electrolyte interface originates from a 
displacement current via the double-layer capacitance and from a faradaic 
process. The I-V characteristics of the faradaic current cannot be represented by 
simple linear circuit elements, but by a voltage dependent current source in 
parallel with a voltage dependent resistance [18]. We use this approach to model 
the behavior of working and counter electrodes. For the reference electrode the 
current source in parallel with the resistance is replaced by a voltage source in 
series with a resistance. 

The small signal open loop AC model is obtained by cutting the DC loop at the 
inverting input of the amplifier. The amplifier is then described in terms of its 
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Fig. 6: Folded cascade amplifier used as potentiostat circuit here. 




« 0 



Fig. 7: Chip architecture considered for the potentiostat design. 



small signal parameters transconductance gm,op, output resistance rout, op, and 
output capacitance Cout.op^ The current sources and the voltage source used to 
model the electrodes turn into opens and a short, respectively, and the voltage- 
dependent resistances are substituted by the derivatives of their V-I 
characteristics at the bias point under operation. The resulting equivalent circuit 
is depicted in Fig. 8 (bottom). 

The calculated transfer function is too complex to be discussed in all details 
within the context of this overview. It has two zeros and three poles, so that the 
loop is not unconditionally stable. The parameters which determine the transfer 
function are grouped into three categories: 
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Fig. 8: Top: DC model of the electrochemical system composed of working 
electrodes and potentiostat regulation loop. Bottom: AC Small-signal open loop 
model. 



• Circuit design dependent parameters 

The opamp parameters gm,op and rouwp are chosen to provide a high open loop 
gain of the amplifier at low frequencies. Couwp results from the parasitic 
capacitances at the opamp’s output node. 

Since the input offset voltage of the amplifier is a function of the area of the 
input transistors [25, 26], matching requirements determine their widths and 
lengths and consequently also the value of the input capacitance C/„. The 
capacitances of counter and reference electrode {Cent and C^/, respectively) 
are proportional to the electrode areas, which represent free design 
parameters. 

• Parameters defined by the sensor design 

Sensor design, i.e. design of width, spacing, and area of the sensor electrodes 
depends on technology design rules and on the required magnitude of the 
minimum sensor signal levels. Moreover, microfluidic and functionalization 
aspects must be taken into account by defining their minimum area and pitch. 
Under these boundary conditions, we treat these parameters as fixed values. 
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• Electrochemical parameters 

The values of the small signal resistances of working electrodes, counter and 
reference electrode, rgen, rcoh ^cnh and r,^/, respectively, depend on 
electrochemical parameters of the electrolyte and on the concentration of 
the generated electrochemical products. These time-dependent parameters 
are only roughly known, and estimation accuracy extends over a couple 
of decades, so that careful consideration within a wide range of 
values is demanded to guarantee stable operation under all possible 
conditions. 

The values of the double-layer capacitances of all electrodes are only weakly 
dependent on the concentration of the electrolyte and can be estimated with an 
accuracy of one decade. 

To evaluate the effect of variations of the electrochemical parameters and to 
guarantee stability under these boundary conditions, we start with a set of 
nominal values for the parameters adjustable by design (gm,op, C out, op, and 

rout.op) and reasonable assumptions concerning the range of the electrode 
geometry dependent parameters, Ccd, Cent, Cref, r^en, rcoi, rent, and 

respectively. The phase margin is calculated by simultaneously varying different 
combinations of two parameters within a large range while allocating nominal 
values to the other parameters. The identified critical cases are depicted for three 
combinations of parameters in Fig. 9. The contour line indicates 70° phase 
margin. Projection of this line in the parameter plane identifies the region of 
save and stable operation. 

These results can be understood considering the poles and zeros of the transfer 
function. The amplifier causes a first pole while the counter and working 
electrodes are responsible for a second pole and a zero. The third pole and the 
second zero originate from the contribution of the reference electrode. They are 
given by the following simple expressions: 



fzeroj'ef ^ ref } 


( 2 ) 


fpole_ref {^^^ref ^ref } 


( 3 ) 


Under condition 




C;„ « Cref 


( 4 ) 



the related pole and zero compensate for each other so that a system with only 
two poles and one zero remains. Using the same procedure as described before 
but within a parameter room with reduced dimension {rent, rgen, reoi, rrej), stability 
is re-evaluated again. A minimum phase margin of 136° is obtained in our 
system in case condition (4) is fulfilled. 
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Fig. 9: Calculated phase margin for 
different combinations of system 
parameters: light gray = stable, dark 
gray = critical. 



6. Analog Sensor Site Circuit 

In this section, a fully analog sensor 
site circuit is discussed providing 
100-fold amplified sensor currents at 
its output. The chip architecture, 
where this approach is used, is the 
same as in Fig. 7. Sensor site 
selection is done by column and row 
decoders and a multiplexer. A 
common potentiostat circuit as 
discussed in section 5 is used for all 
sensor sites in parallel. 

The sensor circuit is depicted in 
Fig. 10. It consists of two regulation 
loops to control the potentials of both 
electrodes. The currents of generator 
and collector electrode are 
considered. They are amplified using 
two current mirrors in series within 
each branch. According to the 
specifications mentioned above and 
aiming to cover a wide range of 
possible applications, the circuit is 
designed to be operated with sensor 
currents from 1 pA to 100 nA. 

The regulation loop consists of a 
simple folded cascade operational 
amplifier and a source follower 
transistor. Due to the high dynamic 
range of five decades in current, the 
latter device is operated in the 
subthreshold regime in most cases. 
Stability is obtained by introduction 
of frequency compensation 
capacitors, Q, between the output of 
the operational amplifier and the 
source of the source follower 
transistor (cf. Fig. 1 1). 

In the following, we consider the 
relative accuracy of the pixel transfer 
characteristics. The transfer 
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Fig. 10: Analog sensor site circuit. 
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Fig. 11: Bode plot of the regulation loop for sensor currents between 1 pA and 
100 nA. 



characteristics of each pixel can optionally be calibrated (and characterized) by 
forcing reference currents into both branches of the circuit through transistors 
Mn51 and Mp51 (i.e. at the same nodes, where the sensor currents are applied) 
and measuring the circuit response (output current) for different operating 
points. 

The variation of the transfer characteristics is determined by the variation of 
the current gain of the current mirrors (Mnl 1-Mnl4, Mpl 1-Mpl4, Mn31-Mn34, 
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Mp31“Mp34). The variation of the current gain mainly originates from the 
variation of the transistors’ threshold voltages, which can be modeled as 
[25, 26] 




( 5 ) 



with <5vt being the standard deviation of transistor threshold voltage as a function 
of its area (width W x length L). The parameter Avt is the threshold voltage 
related matching constant. For the considered process, Avt equals 14 mVpm for 
both n- and p-MOS transistors [27]. 

The operation point of the current mirror devices varies from subthreshold 
region to inversion. In the subthreshold region, the transistor drain current. Id, 
can be modeled as 



( 6 ) 

whereas under strong inversion conditions 

lD=\'xk><Y^{VG-Vtf (7) 

holds. There, lo is a process and device type (n- or p-MOS) dependent constant, 
Vg the transistor's gate voltage, S the transistor's subthreshold slope (a constant 
weakly dependent on the process and the device t}^e), and k a process and 
device type (n- or p-MOS) dependent constant. 

Since the devices Mnl3 and Mnl4, Mpl3 and Mpl4, Mn33 and Mn34, and 
Mp33 and Mp34, only ensure, that the devices Mnll and Mnl2, Mpll and 
Mpl2, Mn31 and Mn32, and Mp31 and Mp32, are operated at similar drain 
voltages, the main contribution to the mismatch of the current gain is provided 
by the latter group of devices. 

Considering the collector electrode related part of the circuit as an example and 
using eqs. (6) and (7), we calculate the gain of the two current mirrors 
Mnl 1-Mnl4 and Mp31-Mp34 in series 

Iput _ ^Mn\2 y ^Mp32 . f.((^f^t,Mnl2,Mnl 1 > )+(A'7,Atp32,A^31 ^ )) 

hal ^MnU ^Mp3l 

under subthreshold operating conditions, and 
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^out _ ^Mn\2 ^Mp32 ^ ^^t,Mp32,Mp3\ 

leal Wm„u L ^G,n-^l,n ^G,p~^t,p j 



under inversion operating conditions. The parameters S„ and Sp stand for the n- 
and p-MOS transistor’s subthresold slopes, AVt,Mni 2 ,Mnii and AVt^MpsiMpsi stand 
for the threshold voltage difference of devices Mnll and Mnl2 and Mp31 and 
Mp32, Vt,n and Vt,p stand for the mean values of n- and p-MOS transistor 
threshold voltages respectively. Length L is constant for all devices, parameter 
variations of W and L can be neglected. 

The relative error of the gain of the two current mirrors is given by 



out ^ ^ cal ) _ I ^ I ^ p)) 

^out ! ^ cal 



( 10 ) 



in the subthrehold regime, and by 



out ^ ^ cal ) 
^out ^ ^cal 




( 11 ) 



in the inversion regime, with a being a process and design dependent constant. 
The parameter I^ai stands for the test current forced into the circuits for the 
calibration or characterization procedure. 

Eq. (11) is based on the relation, that in eq. (9) the denominator in the last two 
terms in brackets is proportional to the square root of the transistor current 
following eq. (7). 

An important result of these considerations is that the gain mismatch is 
independent of the transistors’ gate voltages as long as the operating point is in 
the subthreshold regime. As a consequence, the gain error is constant 
independent of the sensor current resp. the circuit’s output current in this region. 
Under inversion region operating conditions, the gain mismatch is proportional 
to (sensor current)"^^^. 

On this basis, the gain error can be easily modeled for the whole operating 
range: In Fig. 12, the gain error is schematically plotted as a function of 
(7ca/) The behavior can be approximated by two straight lines: one in the 
inversion region increasing in proportion to and the other one in the 

subthreshold region with zero slope. In the weak inversion regime, i.e. where the 
transistors’ operation points change from strong inversion to subthreshold 
region, a transition of the two lines into each other occurs. 

These relations allow easy calibration of the circuit and of the whole array. 
Using two values for the calibration current, the behavior in the inversion 
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Fig. 12: Schematic plot depicting the gain error as a function of 
(calibration currentf’^^ induced by parameter variations of the current mirror 
transistors. 



regime can be determined. These currents, labeled "A" and "B" in Fig. 12, must 
be chosen in the upper region of the range of specified sensor currents. In the 
subthreshold regime, one calibration data point (labeled "C" in Fig. 12) is 
sufficient. The value of this current may for example lie in the middle or in the 
lower half of the specified current window. Storing the measured output currents 
for these three calibration points for each test site, the whole array can be 
calibrated. 

In Fig. 13, measured transfer and gain characteristics as a function of the input 
current of all test sites from a 16 x 8 chip are shown for the generator branches. 
The arrows indicate the device mismatch induced variations. The measured data 
for these variations are in excellent agreement with the theoretically predicted 
values on the basis of the known standard deviation of the transistor threshold 
voltages from this CMOS process. 

In Fig. 14, the result of an oligo experiment is plotted, obtained with the circuits 
discussed so far. In this experiment, on one part of the sensor sites single- 
stranded DNA oligo molecules are immobilized which are exactly 
complementary to DNA oligo sequences in the sample. On the other part of 
sensor sites non-specific mismatch sequences of probe DNA strands are 
immobilized. The experiment is performed on wafer-level with a fluidic contact 
provided via a flow-cell positioned above the chip under test. 

In the figure, the electrode currents and their derivatives are shown for one of 
the positions with matching strands and for one of the positions with mismatch 
sequences. As can be seen, the absolute values of collector and generator current 
and the absolute values of their derivatives look very similar. Match and 
mismatch clearly lead to different amounts of currents and even to derivatives 
with opposite signs withi n the evaluation time window. 
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Fig. 13: Top: Test site circuit output current vs. input (calibration) current of all 
generator test sites from a 16 x 8 chip. Bottom: Measured gain vs. input 
(calibration) current of all test sites from a 16 X 8 chip. 



7. Sensor Site Circuit with In-Site A/D Conversion 

Using the same detection principle as above, in this section another DNA 
sensor array chip is presented with analog-to-digital conversion realized within 
each sensor site [28, 29]. The approach provides the same dynamic range 
of five decades as discussed in sections 5 and 6. The digital sensor site 
circuit output, leads to increased robustness of the data transmission 
within the array. Moreover, it allows to sample the data from all sites 
simultaneously which is in particular advantageous for chips with a large 
number of test sites. 
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Fig. 14. Top: Measured sensor currents for one position with matching strands 
and one position with mismatching sequences. Bottom: Derivatives of the sensor 
currents with respect to the measurement time. The evaluation time slot is 
emphasized with a gray background. 



The concept is based on the realization of a specifically adapted analog-to- 
digital converter within each sensor site. The principle is shown in Fig. 15. The 
voltage of the sensor electrode is controlled by a regulation loop via an 
operational amplifier and a source follower transistor. For A/D conversion, a 
current-to-ffequency converting sawtooth generator concept is used, where an 
integrating capacitor C;„, is charged by the sensor current. When the switching 
level of the comparator is reached, a reset pulse is generated which passes 
through the delay stage and the capacitor is discharged by transistor M^es again. 
The delay stage is required as a pulse shaping unit to ensure a sufficient length 
of the reset pulse applied to Mres so that complete discharging of the integrating 
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Fig. 15: Schematic circuit principle used for in-sensor site A/D-conversion 
based on current-to-frequency conversion. 



capacitor is guaranteed. The number of reset pulses is counted with a digital 
counter. 

The complete test site circuit is depicted in Fig. 16. It consists of two 
complementary parts for the generator and collector electrodes, respectively. 
Note, that at the circuit node marked with an asterisk, a current can be forced 
into the circuit for calibration or test purposes (similar as in Fig. 10). The 
number of pulses is counted with a 24 stage counter within both branches. Since 
these counters consume a relatively large portion of the test site area, a design is 
used which allows to convert the counter circuit into a shift register by a control 
signal for data readout [29]. 

In Fig. 17, a straight-forward comparator circuit design based on a Miller type 
opamp is depicted which is used for the generator branch. The reference voltage 
which defines the switching level is buffered by a second differential stage. A 
simple inverter chain is used as a delay line here. A further alternative pulse 
shaping circuit is considered in [28, 29]. 

The frequency behavior as a function of the sensor current including parasitic 
and device mismatch effects is described by the following equation: 



7 



^offset ^ 
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Fig. 16: Complete sensor-site circuit diagram with digital output. 




Fig. 17: Comparator circuit used for the generator branch in Fig. 16. 

There, / is the oscillation frequency, Vsw the comparator switching level, Vo/fset 
the comparator input offset voltage, C the total capacitance (i.e. + parasitic 

capacitances), leiectrode the sensor electrode current, feak the leakage current 
(e.g. due to transistor junction leakage) at the circuit node connected to the 
comparator input, and tdeiay the comparator delay time. An approximation for the 
measured frequency is given by: 



f ^electrode / (-^sw ^ ^int ) 



(13) 
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Fig. 18: Count rate as a function of the calibration current for the generator 
branches from all 128 sensor sites. (Measured data points per decade = 3). 



The chosen sawtooth amplitude Vsw is 1 V here, C is approximately 140 fF, so 
that frequencies are obtained between 7 Hz and 700 kHz for a sensor current 
range from 10"^^ A to 10’^ A. 

For test purposes, an array with 16x8 test sites is fabricated. Sensor geometries, 
sensor pitches, and chip dimensions are the same as in Fig. 5. The electrical 
behavior of the chips is characterized using the calibration resp. test option for 
the specified range of sensor currents between 1 pA and 100 nA. 

In Fig. 18, the count rate is plotted as a function of the calibration current for the 
generator branches from all 128 sensor sites. Fig. 19 shows the relative deviation 
of the count rate compared to a linear relation following eq. (13) with the given 
values of Vsw and C. 

Three different regions can be distinguished. In the central region between 
10'^^ A and 10'^ A, all pixel circuits follow the simple linear relation with 
deviations below 5%. In the region above 10'^ A, the frequency increase with 
increasing current is lower for two reasons: 1.) Part of the current is 
systematically not considered since the related charge is not integrated during 
the time of the reset pulse, and 2.), the delay of the comparator circuits limits the 
maximum frequency. 

These effects are well understood by simulations, and a correction of the 
measured data can easily be done by the readout software. Moreover, in the high 
current region, range extension can easily be provided by optionally shunting the 
integration capacitor with a second device with a 10-fold or 100-fold value, so 
that the achieved frequencies are lowered by one or two decades. At low current 
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Fig. 19: Relative deviation of the count rate compared to a linear relation 
following eq. (13) for the generator branches from all 128 sensor sites. 




Fig. 20: Standard deviations (3 a values) of the measured count rates for the 
generator and for the collector branch. 



levels leakages of the transistors in the current paths lead to a statistically 
varying behavior of the individual pixel circuits. 

The absolute accuracy of the conversion circuits is determined by process 
variations of the integrating capacitor. From the specifications of the CMOS 
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process an absolute accuracy better than 5% is estimated, which is more than 
sufficient for application of such sensor arrays. The more important feature is 
the homogeneity (i.e. the relative accuracy) of the array. Fig. 20 shows the result 
of an experimental evaluation of this parameter. The 3c standard deviations of 
the measured frequencies are plotted for both electrodes. As can be seen, the 3c 
values are below 2 % in the current range from lO '** A to 2 x 10'^ A. In the 
range from lO " A to 10"’ A, the 3a deviations are below 12 %. At 10"'^ A, 3a 
values between 12 % and 20 % are obtained depending on the considered 
circuit. 

This behavior can easily be understood using the Gaussian formula for error 
propagation. 






(with F being a function of parameters pi, p 2 , P 3 , ... ). Applying this relation to 
eq. (12), the relative standard deviation of the measured frequency is obtained: 
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In the mid frequency resp. mid current range, the variations are dominated by 
the current- and frequency-independent terms in eq. (15), i.e. by mismatch of the 
total capacitance and by the comparator offset voltage. For low currents resp. 
low frequencies, leakage current mismatch leads to a contribution proportional 
to llleiectrode- In the high current resp. high frequency region the delay mismatch 
dominates due to the fact that the contribution of this parameter increases in 

proportion to lelecrode- 

In conclusion, the data from Fig. 20 represent tolerable levels for most 
applications without calibration at a high dynamic range. In case higher 
accuracy is required, calibration of the individual sensor site circuits as 
described above can be performed to compensate for these deviations. 

In order to demonstrate biological measurements, two sensor rows are spotted 
with two different DNA probe sequences. A sample containing target molecules 
matching to one of the probes is applied. A flow cell is used in this experiment 
which covers only 5 columns and 16 rows of the sensor field, so that only 80 
positions are shown in Fig. 21. There, a clear increase in count rate is observed 
at the matching positions (row 8), whereas no response is seen at the non- 
matching positions (row 7) as compared to the uncovered positions. The 
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Fig. 21: Result of a DNA experiment. Row 8: matching probes, row 7: 
mismatching probes, all further positions not functionalized. 



gradient in the column direction originates from microfluidic non-idealities of 
the flow cell. 



8. Outlook and Conclusion 

Circuit design issues have been discussed concerning CMOS-based sensor 
arrays for fully electronic DNA detection. Potentiostat and sensor site circuit 
design have been considered in detail, moreover the used detection principle and 
issues concerning the extended CMOS process have been briefly addressed. 
Experimental results from electronic characterizations and from DNA 
experiments applied to demonstrator chips with 16 x 8 positions have been 
shown. 

A prototype design of such chips with same sensor site circuitry, but 
extended peripheral circuitry (such as bandgap references, D/A-converters to 
provide the required voltages for electrochemical operation on-chip, and serial 
electronic interface to reduce the amount of electrical interconnects down to a 
few pins only) together with an adequate packaging approach is under 
development. 
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Abstract 

WiseNET offers an ultralow-power platform for the implementa- 
tion of wireless sensor networks. This low-power operation is achieved 
thanks to a careful co-design approach of a new medium access con- 
trol (MAC) protocol called WiseMAC and a dedicated duty-cycled 
radio. WiseMAC is based on CSMA with an adaptive preamble sam- 
pling. It is particularly well suited to ad-hoc and hybrid infrastruc- 
ture networks. The 1 regeneration WiseNET transceiver allowed to 
demonstrate the feasibility of integrating ultralow-power transceivers 
in a standard digital CMOS process. It operates in the 434 MHz 
band and consumes only 1 mW from a 1 V supply, while achiev- 
ing a —95 dBm sensitivity for a 24 kb/s data rate with a 10“^ BER. 
A 2“'^ -generation WiseNET transceiver was designed and specifically 
optimized for the new WiseMAC protocol. This new WiseNET radio 
offers dual-band operation, runs from a single 1.5 V battery and op- 
erates downto 0.9 V while consuming only 1.8 mW in receive mode. 
It achieves a —104 dBm sensitivity for 25 kb/s data rate with a 10“^ 
BER. In addition to this low-power radio, the WiseNET system-on- 
chip (SoC) also includes all the functions required for data acquisi- 
tion, processing and storage of the information provided by the sen- 
sor. The WiseNET solution consumes about 100 times less power 
than comparable solutions available today 
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Figure 1 : Lifetime of a single AA battery versus average power consumption using a simple 
model with a capacity of 2.6 Ah and a leakage current of about 30 /iA. 

1 Introduction 

A network of wireless sensors consists of a large number of energy-autonomous 
microsensors distributed in an area of interest. Each node monitors its local 
environment, locally processes and stores the collected data so it can be used 
by other nodes. It shares this information with the other neighboring nodes 
by using a wireless link. Specific features of interest to the end-user can be 
extracted from the different information collected by several nodes by using 
data fusion while this information is propagated to some base station node by a 
multi-hop communication scheme [1,2, 3, 4]. 

Since the nodes are many and since they might be deployed in regions diffi- 
cult to access, they should not require any maintenance. The nodes have there- 
fore to be energetically autonomous and hence the batteries cannot be replaced 
or recharged. In many application scenario the targeted node lifetime ranges 
typically between 2 to 5 years, which sets a drastic requirement on the node 
power consumption. To give a better idea of the power consumption constraint, 
the node lifetime is plotted in Fig. 1 versus the average power consumption, as- 
suming that the node is operated from a single 1.5 V AA alkaline battery. Note 
that the choice of using a single alkaline battery in WiseNET is mainly moti- 
vated by cost. Fig. 1 shows that a lifetime laying between 2 to 7 years requires 
an average power consumption comprised between 10 to 100 /rW. Since the 
power consumption of radio transceivers commercially available today is typi- 
cally in the several 10 mW range, it is clearly impossible to keep the transceiver 
active all the time. Therefore, the only way to reach the required tens of yuW is 
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to have the nodes sleep most of the time by introducing some duty cycling in 
the order of 0. 1 % to 1 % and to keep a low sleep mode current. 

The reduction of the power consumption induced by the communication pro- 
tocols requires the optimization across all the layers, from the physical layer, 
the channel coding, the MAC layer, up to the routing, transport and application 
layers. The MAC layer probably plays the most crucial role in the overall en- 
ergy efficiency of the communication protocols, especially for networks with 
low duty cycling radios. Therefore, the WiseNET project mnning at the CSEM 
started with developing WiseMAC, a new low-power medium access control 
protocol designed for low duty cycle wireless sensor networks. The operation 
of WiseMAC was then carefully analyzed in order to identify the most impor- 
tant radio parameters affecting the power consumption. Since the parameters of 
the radio transceivers available on the market did not meet the expectations and 
could not be modified, a dedicated WiseNET radio transceiver where these pa- 
rameters are optimized for WiseMAC was designed. This co-design approach, 
allowing the overall power consumption optimization by exploiting the intimate 
relation between the MAC layer performance and the radio transceiver parame- 
ters, is truly the heart of the WiseNET solution. 

A microsensor node performs several functions such as: sense environmental 
physical parameters, process the raw data locally to extract characteristic fea- 
tures of interest, store momentarily this information, transmit it to its neighbors 
by a wireless link. The node has also to operate as a relay for implementing 
multi-hop communication, by receiving the data coming from one or several of 
his neighbors, processing it before routing it to the next neighbor towards the 
destination. To perform all these functions, a sensor node includes many sub- 
systems which for minimizing the power consumption and reducing the cost 
can be integrated in a single system-on-a-chip (SoC). Therefore, in addition to 
the radio optimized for the WiseMAC protocol, a complete sensor node SoC, 
including most of the required functions on-chip, was developed. 

The paper starts with a description of the overall WiseNET network archi- 
tecture and the related requirements on the MAC and the radio in Section 2 
followed by a presentation of the WiseMAC protocol in Section 3. General 
considerations on the design of low-power transceivers operating at low- voltage 
are given in Section 4. The 1®* -generation WiseNET radio is then described in 
Section 5, followed by the description of the new generation of WiseNET SoC 
including the optimized radio in Section 6. Some future perspectives and con- 
clusions are then given in Section 7. 
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2 The WiseNET Approach 

2.1 Ad-hoc, Infrastructure and Hybrid Network Architectures 

Wireless networks are used in many ways that lie in between two extremes: 
the infrastructure mode and the ad-hoc mode. In the infrastructure mode, a 
mobile node does not communicate directly with other nodes but only with 
special nodes, the so-called base stations (BS). The latter are linked together 
through a conventional network forming the infrastructure. A mobile node that 
wants to communicate, first registers to one of the base stations that are in direct 
communication range. When a node wants to send a packet to another node, it 
sends it to the BS. If the destination is registered to the same BS, the latter 
transmits the packet directly to the destination node. Otherwise, the packet 
is forwarded through the infrastructure to the BS at which the destination has 
registered. Cellular telephony and paging systems are typical examples of the 
infrastructure mode. This is also the case of wireless LANs using IEEE 802.1 1. 

In the ad-hoc mode, such infrastructure does not exist. Nor do the base sta- 
tions. If the destination is in range of the source, the packet is sent directly from 
the source to the destination node. In the contrary, the source node sends the 
packet to one or more intermediate nodes, which forward the packet to other 
nodes until the destination is reached or some other termination criteria is ful- 
filled. With this architecture, a packet is transported from the source to the 
destination via a number of nodes by successive hops, hence the name of mul- 
tihop transmission. An example of such a network architecture is the one used 
in PicoRadio [2]. 

Both architectures assume that there exists a way to find the route that a 
packet has to follow from its source to its destination. This is the subject of 
intensive research [5,6]. In this paper, it is assumed that nodes are static enough 
so that the problem can be considered as a secondary issue. The infrastructure 
based architecture is very popular for a number of reasons, but in particular for 
its relative simplicity, the fact that the base stations do not have power restric- 
tions (they may be powered on the mains) and a better spectrum usage because 
frequency planning may be performed. On the other hand, ad-hoc networks 
have been advocated for their higher versatility and potentially lower power 
consumption. Since no planning and no infrastructure are required, ad-hoc net- 
works may be deployed very quickly and in remote areas. By exploiting multi- 
hop communication, the transmission range for each hop may be reduced. As 
the losses are decreasing at least quadratically with the distance, it is more in- 
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teresting from the power consumption point of view to use two hops of length 
L than a single hop of length 2L. This is true in principle. However, with the 
current transceiver efficiency, thermal power dissipation dominates over RF ra- 
diated power and the power advantage of multihop transmission becomes less 
attractive. 

Between these two extreme architectures, it is possible to find other hybrid 
solutions. For instance, an infrastructure-assisted architecture would mix both 
approaches. A mobile node that is not in range of a base station may use other 
nodes that are in range to relay the packets to the base station. In this architec- 
ture, a packet may or may not go through the infrastructure depending on the 
location of the source and the destination nodes. Similarly, two or more sep- 
arate infrastructure-based networks may exchange packets through a sequence 
of hops on mobile nodes thus forming a single network. 

An example of such hybrid architecture may be found in a building. Rooms 
may be equipped with wireless sensors for temperature, moisture, gas and fire 
detection. These sensors relay the produced information towards a number of 
base stations located in the building. All these base stations are linked by some 
kind of wired network. The information is thus available at control rooms and 
possibly other remote locations. 

In order to offer a maximum of flexibility for different kind of applications, 
one of the important objective of the WiseNET project is to achieve low-power 
operation for all the three network operation modes described above. 

2.2 The Need for a New MAC 

To reduce power consumption, nodes will remain sleeping as long as there is 
no specific tasks to be undertaken. At some defined time, a sensor node will 
wake up and perform some measurement. Note that this wake-up may also be 
triggered by an external event. The node may then decide to communicate the 
gathered information to some neighbor and send him a message. Unfortunately 
the neighbor may be sleeping to save energy. The node must hence keep send- 
ing the information until the neighbor wakes up and acknowledges the correct 
receipt of the information. On the other hand, a node may need information 
from a neighbor. It may try to send a request message to the neighbor and will 
have to wait until the neighbor wakes up as in the previous case. Alternately, it 
may stay up and wait until the neighbor decides to send the information. These 
two simplistic examples show that, if not properly designed, communication 
will increase significantly the power consumption because listening and emit- 
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ting consumes a lot of power. The challenge is then to minimize the energy 
consumption by reducing, eventually even eliminating, the energy waste caused 
by: 

• idle listening, which occurs when a node waits listening until a message is 
emitted by another node; 

• over-emitting, which happens when a node sends a message whereas the 
destination node is not ready to receive it; 

• collisions that may occur when two nodes emit at the same time with the 
consequence that the messages must be retransmitted later; 

• overhearing, which takes place when a node listens for a message that is 
sent to another node and for which it is not the destination. 

This energy waste reduction is the main objective of WiseMAC, the medium 
access control designed for WiseNET and described in Section 3. 

2.3 Reducing Power Consumption by Codesign 

In the WiseNET project, the objective is to ultimately reach the lowest possible 
power consumption for hybrid networks. Such a goal can only be reached by 
a proper hardware-software co-design approach*. Often, the communication 
protocols are designed on top of existing integrated circuits. They must then 
take into account the special features and limitations of those circuits. Here, it 
was decided to design the radio and the protocol concurrently from the start. A 
few important parameters of the radio have a clear impact on the higher protocol 
layers: 

• the power consumption in receive (respectively transmit) mode; 

• the wake-up time, which is the time necessary to switch from the idle mode 
to the receiveor the transmit mode. During this time the node cannot re- 
ceive while power consumption is the same as in receive mode; 

• the bit and frame synchronization time at reception. The frame synchro- 
nization pattern has an impact on the overhead (its length) but also on the 
frame error rate when frame starts are erroneously detected; 

^ Actually, it is more a radio-MAC co-design 




97 



• the presence of an effective RSSI (Receive Signal Strength Indicator), 
which can be used to improve the reliability of frame detection. It also 
reduces idle listening when it is used as a silence detector; 

• some way to filter incoming packets so that overhearing can be reduced; 

• the time necessary to switch from the receive mode to the transmit mode 
or vice-versa (turnaround times). During this period, no communication is 
possible while power consumption is between the reception and emission 
cases; 

• the receiver sensitivity and the maximum transmit power which have an 
impact on the collision probability and the required number of hops; 

• the power consumption in sleep mode with a running accurate clock which, 
depending on the application, can become the major part of the current 
consumed during the node’s life; 

• the capacity to adjust emission power and receiver sensitivity to reduce 
frame error rate and collision probability. 

Taking into account the special context of sensor networks, a few basic 
choices had to be made. First, it was decided not to minimize the transmit 
power, but rather choose a reasonably high constant value (for example the 
maximum allowed emitting power in the chosen bands, which is 10 dBm for 
the 434 MHz ISM band). The main reason behind this is that nodes usually 
emit rarely and thus the emission energy is not the most important parameter 
to be optimized. Second, the effort was concentrated on reducing the energy 
consumption in receive mode as well as the wake-up time. Since a form of 
preamble scanning is used at the MAC layer, waking-up quickly has a clear im- 
pact on energy consumption. For the same reasons, the RSSI information has 
been carefully elaborated. As a third level of optimization, robust bit synchro- 
nization and packet filtering based on a programmable pattern has been added. 
Both reduce frame losses while the former additionally reduces overhearing. 
The last parameter in the above list was found to be less important in the first 
MAC simulations and has therefore not been implemented since it required ad- 
ditional circuitry. 
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Figure 2: WiseMAC - Minimization of the wake-up preamble length. 



3 WiseMAC - The WiseNET Low-Power Protocol 



The low power MAC protocol developed at CSEM for wireless sensor networks 
is called WiseMAC {ySfireless Sensor MAC) [7]. It is a single channel con- 
tention protocol based on non-persistent carrier sense multiple access (CSMA). 
Non-persistent CSMA has been combined with preamble sampling to mitigate 
idle listening. The preamble sampling technique consists in regularly sampling^ 
the medium to check for activity. All nodes in a network sample the medium 
with the same constant period. Their relative sampling schedule offsets are 
independent. If the medium is found busy, a node continues to listen until a 
data packet is received or until the medium becomes idle again. At the trans- 
mitter, a wake-up preamble of size equal to the sampling period is transmitted 
in front of every data packet to ensure that the receiver will be awake at the 
start of the data portion of the transmission. This technique enables very low 
power consumption when the traffic is very low as it is usually the case in sen- 
sor networks. Using a conventional receiver (i.e. without a second dedicated 
wake-up radio [8]), it provides, for a given wake-up latency, the lowest possible 
power consumption in the absence of traffic. The main disadvantage of this ba- 
sic preamble sampling protocol is that the low power consumption in idle mode 
is coupled with a high power consumption overhead both in transmission and in 
reception, due to the wake-up preamble. In an ad-hoc network, the cost of re- 
ception is not only paid by the intended destination, but also by all other nodes 
overhearing the transmission. The novel scheme introduced by WiseMAC aims 
at reducing the length of this costly wake-up preamble. This scheme consists 
in learning the sampling schedule of the direct neighbors, and exploiting this 
knowledge to use a wake-up preamble of minimized size, as illustrated in Fig. 
2. The sampling schedule of a neighbor is learned, or refreshed, during every 



^In this context, sampling means periodically measuring the received signal strength (RSSI). 
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Figure 3: Average power consumption as a function of the traffic. 

data exchange by piggy-backing in the acknowledgement messages the remain- 
ing time until the next sampling instant. Every node keeps a table of sampling 
time offsets of all its usual destinations up-to-date. Since a node will have only 
a few direct destinations, such a table is manageable even with very limited 
memory resources. 

The duration of the wake-up preamble must cover the potential clock drift 
between the clock at the source and at the destination. This drift is proportional 
to the time since the last re-synchronization (i.e. the last time an acknowledge- 
ment was received). It can be shown that the required duration of the wake-up 
preamble is given by Tp = mm.{‘i6Tc,Tw), where 9 is the frequency toler- 
ance of the time-base quartz, Tw is the sampling period and Tc the interval 
between communications. A transmission is scheduled such that the middle of 
the wake-up preamble coincides with the expected sampling time of the desti- 
nation. Systematic collision situations that could arise by this synchronization 
are mitigated using a wake-up preamble of randomized size. 

Because the length of the preamble is proportional to the interval between 
packets, the overhead of WiseMAC is adaptive to the traffic. WiseMAC can 
hence provide both an ultra-low average power consumption in low traffic con- 
ditions and a high energy efficiency in high traffic conditions. Fig. 3 presents 
simulation results of a lattice multi-hop topology. The radio range is chosen 
such that every node has 8 neighbors. Traffic is flowing through each node with 
the inter-arrival time shown on the x-axis. It can be seen that WiseMAC can pro- 
vide both, a power consumption below 20 //W in low traffic conditions and ap- 
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proach the power consumption of an ideal protocol in higher traffic conditions. 
With an inter-arrival time of 100 seconds, the power consumption amounts to 
25 yuW, which translates into more than 5 years lifetime on a single AA alkaline 
battery. For comparison, Fig. 3 also shows the performance of T-MAC [9], an 
improved version of S-MAC [10], with different duty cycles, of CSMA/CA and 
of an ideal protocol. All protocols have been simulated using the power con- 
sumption and timing parameters of the WiseNET transceiver given in Table 2. 
The power consumption of CSMA/CA is limited at low traffic by the power 
consumed in receive mode, since the transceiver is never switched off. The 
ideal protocol represents the minimum power consumption required to transmit 
the data without any overhead (idle listening, overhearing, collisions, ...). It rep- 
resents the lower bound that low-power MAC protocols should try to approach. 
In the case of the T-MAC protocol, the period of the listen/sleep phases have 
to be chosen. In the absence of traffic, this ratio is actually the duty cycle of 
the transceiver. The three plots corresponding to the T-MAC protocol in Fig. 3 
are only plotted up to the congestion point above which more than 5% of the 
packets are dropped because of congestion. It can be observed that T-MAC 
can either provide a low power consumption in low traffic conditions, or a high 
throughput, whereas WiseMAC can provide both. 

WiseMAC has been designed for multi-hop sensor networks. A single- 
channel contention protocol has been selected to ease self-configuration. WiseMAC 
requires no set-up signaling nor network-wide time synchronization. The com- 
bination of preamble sampling and wake-up preamble length mi ni miz ation pro- 
vides both an ultralow power consumption in low traffic conditions and a high 
energy efficiency in high traffic conditions. Although designed initially for 
multi-hop networks, WiseMAC has proven to be suited for the downlink of an 
infrastructure network as well. It can be shown, in low traffic conditions, that 
WiseMAC provides a lower power consumption than the power save scheme 
specified in IEEE 802.11 and IEEE 802.15.4 [11]. WiseMAC can hence be 
used, in a hybrid network topology, to receive data both from battery powered 
nodes and from energy unconstrained base stations. 

4 Transceiver Design Considerations 

4.1 Low- Voltage and Low-Power Constraints 

In order to reduce the overall size and cost of the sensor node, a single AA alka- 
line battery is used. The circuit has then to operate down to 0.9 V correspond- 




101 




(a) Basic circuit used to evaluate the 
overdrive voltage. 




(b) Inversion factor versus overdrive voltage. 



Figure 4: Maximum inversion factor for a given overdrive voltage. 



ing to the end-of-life voltage of the battery. Because of this low-voltage, the 
operating points of the analog transistors are shifted to moderate or even weak 
inversion, even for those working at RF [12, 13]. The design of circuits operat- 
ing at such low-voltage and low-current density requires a transistor model that 
is valid and accurate in all regions of operations and particularly in weak and 
moderate inversion [12, 13]. The circuits presented below have been designed 
using the EKV transistor model [14]. In this model, the operating points are no 
more characterized by their overdrive voltage Vgt — Vgs ~ Vt, which can be- 
come negative under the threshold voltage and has no more meaning, but rather 
by the so-called inversion coefficient IC defined as 

JCA_^ ( 1 ) 

^spec 

where Id is the drain bias current and I spec — 2nj3U^ is the specific current, with 
(3 = fiCoxW/L, where ji is the mobility, Cgx the gate oxide capacitance per unit 
area and W and L are the width and length of the transistor, respectively, n is 
the slope factor (typically comprised between 1.2 in strong inversion and 1.6 in 
weak inversion) and Ut = kT/q\he thermodynamic voltage. Although no clear 
physical definitions can be identified for the different regions of operations, 
they can conveniently be defined as: weak inversion for IC <0.1, moderate 
inversion for 0.1 < IC < 10 and strong inversion for 10 < IC. 

Considering the generic circuit shown in Fig. 4(a), the maximum overdrive 
voltage is limited by the supply voltage Vdd according to 

Vdd >Vt + Vgt + 2Vsat + Kig 



( 2 ) 
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(b) For a 0.18 /xm process. 

Figure 5 : Frequency limitations of a multi-fingered NMOS transistor versus the inversion factor 
(bottom x-axis) or the gate overdrive voltage Vq — Vt normalized to Ut = kT/q (top x-axis). 
fc is intrinsic channel cut-off frequency, fr corresponds to the unity current gain and fc is the 
maximum frequency for a given voltage gain. The symbols are measured /t. 





where Vt is the threshold voltage, Vsat is the minimum saturation of a current 
source (in the order of 100 mV) and Vgig is the signal swing (typically 50 mV). 
For Vdd — iV and Vt = 0.6 V, the maximum overdrive voltage is only Vct — 
150 mV = 5n • Ut- According to the plots shown in Fig. 4(b), this corresponds 
to a maximum inversion factor IC of about 8 . This clearly confirms that most 
of the transistors have to operate in moderate or weak inversion. 

Operating in these regions is definitely advantageous for the analog base- 
band because of the higher current efficiency (i.e. gm/Io ratio) and the lower 
saturation voltages reached in weak inversion, but it may introduce some fre- 
quency limitations. This can be evaluated by looking at the transit frequency 
/r, corresponding to the frequency at which the current gain is unity. A bet- 
ter figure of merit is fg, corresponding to the frequency at which a common 
source amplifier loaded by an identical stage has a given voltage gain G. For 
all these definitions, the bias point is set at V 5 == 0 and Vd = Vg- These char- 
acteristic frequencies are plotted in Fig. 5 versus the inversion coefficient IC 
for a 0.5 gm and a 0.18 ^m process. Fig. 5(a) demonstrates that 433 MHz and 
even 910 MHz operation is achievable with the 1 V constraint for IC < 10. 
This was clearly confirmed by the 1®* -generation of WiseNET transceiver de- 
scribed below. Fig. 5(b) shows that moving to a 0.18 /rm process allows to 
have the transistor operating in the middle of the moderate inversion and still 
achieve a 10 dB gain at 2.4 GHz. Operating the transistors in the moderate in- 
version clearly offers a good trade-off between high-frequency, low-power and 
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Figure 6: Average power consumption versus data rate. 

low- voltage while achieving a reasonable linearity. 

Low current consumption forces the use of small branch currents and there- 
fore also small transconductance values. Thus, substantially higher impedance 
levels are required than for standard 50 Q, RF design and care should be taken to 
minimize the load capacitance at each node. As a consequence, small devices 
with a small active area have to be used, which affects the design and the per- 
formances. This unfortunately leads to poorly matched devices, as the error in 
current between two identical transistors sharing the same bias can easily reach 
10%. In many circuits, this effect must be reduced by using adequate dc regu- 
lation loops, nicker noise is also higher for small devices, and the noise comer 
frequency, where thermal and flicker noise are equal, can easily reach several 
megahertz for the minimum length devices used in RF circuits. In particular, it 
could significantly degrade the noise figure (NF) of micropower downconver- 
sion mixers. 

4.2 Duty Cycle and Optimum Data Rate 

At first glance it might seem that the smaller the data rate the smaller the signal 
and noise bandwidth. Therefore, for a given sensitivity, a higher system noise 
figure would be acceptable, resulting in a smaller power dissipation. However 
this would be tme if the radio would be activated 100% of the time. But, as 
mentioned above, in order to achieve the desired average power consumption, 
the radio has to be duty cycled. It can be shown that there is an optimum data 
rate for minimum power consumption [15]. The power consumption can be 
roughly divided into a fixed contribution Pq which is mainly due to the fre- 
quency synthesizer and does not depend on the signal bandwidth and data rate, 
and another contribution Pdem which corresponds to the demodulation chain. 
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including the low-noise amplifier (LNA), the mixers and the baseband channel 
filters, and depends on the system noise figure. 

The average power consumption normalized to its minimum value is plot- 
ted versus the data rate in Fig. 6 for the typical parameters shown in the insert, 
where NFq is the minimum achievable noise figure for a given circuit topol- 
ogy (typically in the 5 to 15 dB range), F is a design constant for the same 
circuit topology (typically equal to 40 mW) relating the minimum noise figure 
NFq to the actual noise figure and A is a parameter relating the actual noise 
figure and the data rate [15]. It indicates that the optimum data rate ranges 
from 10 to 100 kb/s offering an average power consumption in the order of a 
few 10 fiW. The optimum data rate for these parameters is Dgpt — 64 kb/s with 
NFopt = 15 dB and Pav,opt — 32 /^W. The sharp power increase observed in 
Fig. 6 at high data rate is due to a larger bandwidth requiring lower NF (becom- 
ing even negative in dB) and therefore higher power consumption to guarantee 
a sufficient SNR. At low data rate, the power increases also since the receiver 
must stay on a longer fraction of the time and the power consumption is then 
fully dominated by the fixed power Pq related mainly to the synthesizer. 

Note that although the model used in [15] is very crude, it gives a first order 
idea and demonstrates that a relatively high instantaneous data rate with a low 
duty cycle gives much better results than a low data rate with a low performance 
receiver but a large duty cycle. Note that, with — 24 kb/s, NFopt = 20 dB 
and Pav,opt — 324 /nW, the receiver described below falls very close to these 
figures. 

4.3 Low-Power Receiver Architectures 

The superheterodyne architecture has been widely used for high-performance 
receivers. However, it requires external channel-select and image-reject filters 
and is therefore not well suited for fully integrated systems. The simplicity of 
super-regenerative receivers, which include only a few blocks running at RF, are 
well suited for low-power applications [16, 17]. However, this architecture can 
only be used with OOK modulation and its selectivity is intrinsically poor. For 
these reasons, super-regenerative receivers are not the best candidates. Low- 
IF [18, 19] and direct-conversion architectures [20,21,22] can achieve much 
better performance at low-power consumption and are well suited for high in- 
tegration. However, low-IF requires a good I/Q matching for a sufficient image 
rejection, which is difficult to obtain with small RF devices. Some calibration 
techniques might be used, but at the expense of more complexity and additional 
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Figure 7: Block diagram of an upconversion transmitter. 

power consumption. Finally, direct conversion remains the most suitable so- 
lution, although it has its own drawbacks, namely a high sensitivity to flicker 
noise and offsets which can saturate the demodulation chain. These effects can 
be largely circumvented by using a frequency shift keying (FSK) modulation 
with a large modulation index, in order to avoid very low demodulated frequen- 
cies [23]. This guarantees enough room for rejecting flicker noise and offsets 
with a simple high-pass filter, without degrading the transmitted information it- 
self. Note that direct conversion is also interesting in terms of power consump- 
tion, since it requires very few internal nodes operating at RF, merely at the low 
noise amplifier (LNA) and local oscillator (LO) outputs. Moreover, the only 
components required off-chip are the input and output matching networks, pos- 
sibly one or two inductors for an LC-tank voltage controlled oscillator (VCO), 
a quartz resonator as the frequency reference, and a few decoupling capacitors. 



4,4 Low-Power IVansmitter Architectures 

The FSK modulation can mainly be realized either by directly modulating the 
voltage-controlled oscillator (VCO) signal (direct modulation) or by upconver- 
sion. These two possibilities are briefly discussed below. 

The most straightforward solution consists in pulling the instantaneous fre- 
quency of the local oscillator (LO). This can be done either by changing the 
reference frequency, by altering the ratio of the frequency divider, or by directly 
modulating the control voltage of the LC-tank VCO varactors. The first solution 
is not practical because of the low “pullability” of the quartz oscillator, usually 
about 50 ppm. The second solution puts very strong constraints on the fractional 
prescaler, which should allow small frequency steps while keeping a sufficient 
phase-locked loop (PLL) bandwidth and a low power consumption. Finally, the 
direct modulation of the varactors requires a very low PLL bandwidth. More- 
over, the effective frequency deviation might be difficult to master, depending 
on the varactors nonlinear characteristic. 

The classical solution of Fig. 7 is based on the modulation of two quadra- 
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ture LO signals at the carrier frequency, LOi and LOq , by the appropriate 
sinusoidal baseband signals, LFi and LFq. According to the phase relation 
between LFj and LFq (±90°), the output frequency will either be fio — Ilf 
(lower sideband) or /lo + fiF (upper sideband). Note that the LO signals can 
be provided by the same frequency synthesizer circuit, with the same channel 
selection mechanism, as used for the receiver. This solution is very flexible 
and well controlled because it dissociates the generation of the carrier and the 
baseband modulation. Hence, it was selected even though it presents its own 
limitations, which are briefly reminded in the next paragraphs. 

The image signal must be eliminated in order to guarantee a sufficient signal- 
to-spurious ratio at the receiver, after demodulation. The image rejection is 
mostly dependent on the symmetry of the LO and baseband signals and on 
the matching of the upconverter branches. This is summarized by the image 
rejection ratio (IRR) 

IRR = - • (e|o + e\p ± + ^^\f + ^Lxer) (3) 

where clo. and A(fLF the phase and relative amplitude mismatches 

contributions of the LO and baseband signals, respectively, emixer is the mixer 
gain mismatch, and the mixer phase error is assumed to be negligible. A 30 dB 
rejection is largely sufficient to avoid the deterioration of the bit error rate (BER) 
of the transmission. Hence, if the contributions of each term are assumed equal, 
the standard deviation of any of the five mismatches should be kept smaller than 
3%. 

It is important that the frequency keying keeps the baseband signal phase 
continuous in order to limit the emitted signal bandwidth. This condition is 
naturally met by direct modulation solutions, but, for upconversion systems, is 
true only if the baseband signals are themselves continuous in phase. 

The spectral purity of the baseband generator is another concern. Actually, 
every spurious harmonic is directly frequency shifted around the carrier and 
spreads the emitted spectrum, eventually corrupting adjacent channels. 

Since, on one hand, the frequencies of the LO and of the emitted signal are 
only slightly different, and, on the other hand, their power level is very different, 
any coupling between them may unlock the PLL. This coupling will most likely 
arise out-of-chip, on the board. It can be partly avoided by a careful design and 
by providing a relatively high LO swing, but it still limits to a few dBm the 
output power that can be practically expected. However, the maximum emitted 
power is anyway limited to this range both by the ISM bands regulations and by 
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the power which is available from the battery, so that this kind of architecture is 
nevertheless appropriate. 

4.5 Frequency Synthesis 

All the architecture discussed above require a frequency synthesizer. Different 
options for realizing this important building block are discussed below. 

Direct synthesis can be used if a stable and precise resonator is available 
at the required frequency, the local oscillator can then be directly built around 
this component, without the need for any further frequency control mechcmism. 
However, this technique cannot be used for frequencies above a few tens of 
MHz (except eventually using a higher overtone). Up to 1 GHz a SAW resonator 
can be used instead of a quartz. But the current consumption can then easily 
reach several hundreds of yuA. Also, SAW resonators typically have a lower 
frequency accuracy than quartz crystals, easily reaching ±150 ppm and hence 
require some compensation technique. 

In order to avoid the use of relatively expensive high frequency SAW res- 
onators, it would be possible to use a classical quartz oscillator and generate the 
desired frequency by a nonlinear circuit. In practice this technique was almost 
exclusively used at very high frequency of several GHz. Not only must the fre- 
quency multiplier drive its own output load and the load of the synthesizer at the 
LO frequency, but the nonlinear conversion gain is always fairly low. Therefore 
this technique requires very powerful input signals to produce a sufficient level 
of harmonics and additional amplifier stages are required before and after the 
multiplier circuit. Such circuits are therefore not well suited for low-power and 
in addition they are relatively difficult to design in order to guarantee proper 
operation. 

Finally, a standard PLL solution with a frequency divider is usually preferred 
versus a frequency multiplication scheme. It usually requires less power since 
all nodes except the LO and the input of the frequency divider run at a frequency 
lower than the LO itself. The frequency of a low stability high-frequency os- 
cillator is stabilized and locked to a high-stability low-frequency reference by 
means of a frequency divider, a phase comparator and a proper feedback loop. 
The main interest of this structure is that the output frequency can be changed 
by modifying the divider ratio. The low-frequency reference is almost always 
a quartz crystal, which presents a good stability at a low cost. In practice the 
initial frequency accuracy of a quartz operating at a few MHz can easily reach 
±20 ppm, to which a typical aging of about ±3 ppm/year and a temperature 




108 




synthesizer. 

Figure 8: First generation WiseNET transceiver architecture. 

variation of ±15 ppm between 0 and 15°C must be added. No trimming will be 
required if the system can bear a ±27 ppm frequency error. 

The above general considerations have been applied to the design of an ul- 
tra low-power and low-voltage transceiver implemented in a standard digital 
0.5 ij,m CMOS technology. This l®*-generation WiseNET transceiver is briefly 
presented in the next Section. 

5 The First Generation of WiseNET Transceiver 

5.1 The Transceiver Architecture 

The architecture of the 1®*^ -generation WiseNET transceiver is shown in Fig. 8(a). 
It includes a direct conversion receiver, an upconversion transmitter and a PLL- 
based frequency synthesizer that is common to the receiver and the transmitter. 
This l®*^-generation WiseNET transceiver is presented in details in [23,24, 15,25] 
and will only be briefly presented here. 

5.2 The Receiver 

The direct conversion receiver architecture is presented in Fig. 8(b) and was 
chosen mainly for its high integration capability and also in order to minimi ze 
the number of internal nodes operating at RF. As mentioned above, the residual 
offset and high flicker noise drawbacks inherent to direct conversion have been 
circumvented by choosing a wide frequency shift of 100 kHz for a 24 kb/s 
data rate allowing for high-pass filtering the downconverted signal. The front- 
end, composed of the LNA and the two quadrature downconversion mixers. 
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Figure 9: Block schematic of the transmitter section. The frequency synthesizer signals, LOi 
and LOq , are provided by the synthesizer circuit which is shared with the receiver. 

amplifies the signal and transposes it around dc, where the bandpass channel 
filtering is performed. Because active filters with a high dynamic range are very 
difficult to conciliate with low power consumption and supply voltage, passive 
first-order RC filters were connected directly to the mixer outputs in order to 
improve the interferer rejection at large frequency offsets from the carrier. The 
additional pole relieves the stress on the active filters, effectively making their 
saturation less likely. Finally, logarithmic limiters compress the dynamic range 
of the filtered signal and ensure a sufficient level at the demodulator input. 



5.3 The Transmitter 

The transmitter block diagram is shown in Fig. 9. A low-frequency, quadrature 
oscillator LO-BF operates at the FSK frequency offset A/, and the sign of the 
90°phase shift between its I and Q signals can be controlled by the binary data 
signal. This action corresponds to an instantaneous swap of the frequency sign 
of the complex signal I + jQ. This baseband signal is upconverted around the 
carrier frequency by a quadrature upconverter driven by the frequency synthe- 
sizer, which is common to the receiver and will be presented in the next Section. 
This kind of “direct upconversion” avoids any filtering of the undesired image. 
It might present other problems since the PA and the LO operate around the 
same frequency as mentioned in Section 4.4. The impedance level at the output 
of the upconversion mixers is still very high, since the mixer transistors must not 
be be too wide in order to avoid excessively loading the low-power oscillators. 
Therefore, the modulated RF signal goes first through a series of preamplifier 
stages presenting lower and lower output impedance levels, until the very large 
devices of the PA can be driven. The amplitude of the signal fed in the matching 
network is measured by a kind of rectifier and is used to control the actual bias 
of the preamplifier stages in order to ensure a controlled output power level. The 
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Figure 10: Structure of the frequency divider. 



reference of this control loop can be changed to vary the output power between 
0 and 10 dBm, in order to be able to adapt to the power level required by the 
transceiver environment and therefore save power consumption. 



5.4 The Frequency Synthesizer 

The frequency synthesizer structure is presented in Fig. 8(b). It includes a 
VCO coupled to a phase shifter, in order to produce quadrature signals. The 
quartz reference frequency fref was chosen sufficiently high to simplify the 
PLL design, without significantly adding to the power consumption of the sys- 
tem. Therefore, the total divider ratio was set to 128, with fref = 3.39 MHz 
for 434 MHz operation. The image-reject mixer placed in the frequency divi- 
sion chain, depicted in Fig. 10, allows for channel selection by finely tuning the 
synthesized frequency. It shifts the input frequency of the divide-by-A ^2 block, 
relatively to the output of the first prescaler, so that the LO frequency fio be- 
comes 

/LO = /re/- (iV2±^) -iVl. (4) 

The principle of channel selection described here has the advantage of having 
simple circuits operating at high frequency, at the cost of the generation of ana- 
log low-frequency quadrature signals. This task is dedicated to the divide-by-M 
block which is, in fact, some kind of very simple direct digital frequency synthe- 
sis (DDFS) system. To keep the low-power goal, such a solution is practically 
limited to relatively large frequency steps (> 50 kHz), because any spurious in 
the analog mixing signals can only be eliminated by the PLL loop filter. For 
this design, Ni was equal to 4, N2 = 32, and factor M was selectable among 
the seven values ±16, ±24, ±48, and 00 , corresponding to frequency offsets of 
k X 282 kHz, with k G [—3; 3]. 
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Figure 11: Receiver measurements. 



5.5 Experimental Results 

The transceiver chip is shown in Fig. 11(a), where the main blocks are high- 
lighted. The total silicon area is about 6 mm^, without the pads. The circuit was 
encapsulated in a standard CQFP80 package. The fairly high number of pins is 
due to the fact that access was given to many internal nodes, making possible 
the separate test of some blocks. Fig. 11(b) summarizes the power consump- 
tion breakdown among the different receiver blocks. The oscillator, the active 
phase shifter, and the frequency divider are clearly the main contributors, about 
20% each. This shows how much the power consumption of direct-conversion 
receivers is dominated by the frequency synthesizer, where most RF nodes are 
located. For this design, only a single additional pair of RF nodes was required 
in the front-end, which is another significant contributor to power consumption 
(10%). The two filters require another 10% to keep their noise at a reasonable 
level. Finally, the limiters, demodulator, dc bias, and ALL circuits consume 
slightly less than 20%. 

All the results presented in this section have been measured with a 1 V supply 
voltage and for a 1 mW total power consumption. Measurements were also 
performed with the help of a discrete input matching network which allowed 
the use of standard 50 generators. The input matching network losses (a few 
decibels) were not deducted from the measured sensitivity. 

The LO amplitude was set to 125 mV, for a phase noise of —112 dBc/Hz at 
500 kHz from the carrier. The VCO tunability depends on the supply voltage. 
For a 1 V operation, a 5% frequency control has been achieved, while it raised 
to 7% with a 1.2 V supply (402..465 MHz). The global tank quality factor is 
about 23, over the whole tuning range. For supply voltages above 1.1 V, the 
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(a) Loss of sensitivity due to a mismatch between 
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quency. 




(b) Rejection of an unmodulated interferer, for a 
signal 3, 6, and 10 dB stronger than the receiver 
sensitivity at 10“^ BER. The rejection is relative 
to the in-band signal power, and is again measured 
at 10-3 BER. 



Figure 12: Receiver measurements. 



phase error of the phase shifter is kept smaller than 4. At 1 V, however, some 
transistors leave saturation a significant fraction of the time, so that the phase 
error goes up to 15°, which is still acceptable. 

The receiver global performances are illustrated by Fig. 12. Its sensitivity at 
a bit error rate (BER) of 1/1000 was measured at -95 dBm for a 24 kb/s pseudo- 
random binary signal. Fig. 12(a) shows that the system can accept a ±80 ppm 
mismatch of the LO frequency relatively to the incoming signal carrier with a 
3 dB sensitivity loss (±50 ppm at 1 dB). Such an effect could be due, for exam- 
ple, to a temperature change or, simply, to the quartz aging. Fig. 12(b) illustrates 
the rejection of an unmodulated out-of-band interferer. The measurement was 
performed by setting the input signal 3, 6, or 10 dB stronger than the minimum 
signal power corresponding to a BER of 10“^. Then, the power of the unwanted 
signal was increased until the BER was again brought back to 10“^. Finally, 
the ratio of the wanted to the unwanted signal power is plotted in Fig. 12(b). A 
52 dB rejection is achieved at a 1 MHz offset. The measured characteristic is 
very similar in the case of an FSK-modulated interferer, at least for frequency 
offsets higher than 300 kHz. 

Fig. 13(a) shows the evolution of the PA maximum output power and global 
efficiency as a function of the supply voltage. Note that the efficiency is nearly 
constant at 40% for supply voltages in excess of 1.2 V, while it is still 17% at 
1 V. The output power varies between 2 and 24 mW when the supply is swept 
between 1 and 2 V. 

Fig. 13(b) shows the measured output spectra around the carrier for a 100 kHz 
frequency deviation, both for a “0” (light gray) and a “1” symbol (black). The 
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Figure 13: Power amplifier measurements. 



thin curve represents the theoretical spectral density of a 20 kb/s pseudorandom 
modulation in the same conditions. This measurement shows that the image 
frequency is rejected by 38 dB, which is well above the required specification. 
The distortions of the baseband signals generate components which are at least 
39 dB smaller than the carrier, decreasing with the frequency offset about the 
same way than the modulated spectrum. Consequently, almost no broadening 
of the real spectrum is expected due to these nonidealities. 

The regeneration WiseNET transceiver described above has really demon- 
strated the feasibility of integrating an ultralow-power RF transceiver in a 0.5 
standard digital CMOS process with 0.6 V threshold voltages. It has also shown 
that it is possible to operate the transceiver from a 1 V power supply in the 
434 MHz European ISM band and consume only 1 mW in receive mode while 
achieving a -95 dBm sensitivity for a data rate of 24 kb/s. These encouraging 
results motivated the design of a new transceiver especially optimized for the 
WiseMAC protocol. It is briefly described in the next Section. 



6 The WiseNET SoC 

Since in a WiseNET network there are many distributed microsensors, it is not 
envisaged to replace or recharge the batteries. The nodes therefore require a 
long autonomy (typically of more than 3 years) and consequently a very small 
average power consumption. In addition, they have to be very small in order to 
fit in all kind of spaces and, because of their high number, they also have to be 
very low-cost. From these basic specifications, it is then advantageous to realize 
them as highly integrated devices using a dedicated integrated circuit (IC) with 
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Figure 14: Generic building blocks of the WiseNET SoC used in low-end sensor nodes, 
a System-on-a-Chip (SoC) approach. 

This perspective led to the design of the WiseNET SoC which basic archi- 
tecture is described in Fig. 14. In addition to the ultra low-power dual-band 
radio transceiver (Rx and Tx in Fig. 14), it also includes a sensor interface with 
a signal conditioner and two analog-to-digital converters (ADCs) (ANA_FE 
in Fig. 14), a digital control unit based on a CoolRISC microcontroller {jj£) 
with on-chip low-leakage memory (i.e. RAM) and several time basis (real-time 
clock,...) and digital interfaces (SPI, I2C, ...) (CTRL in Fig. 14) and finally 
an important power management block (ROW in Fig. 14). Although all these 
blocks contribute to power consumption, the most critical one, as of today, still 
remains the RF transceiver. The 2"'^-generation WiseNET transceiver was de- 
signed roughly along the same guidelines as the P*-generation described above, 
but taking advantage of moving from a 0.5 ^m to a 0.18 /xm standard digital 
CMOS process. In addition to the main features of this new transceiver de- 
scribed in Table 1, the most important design objectives were: 

• to keep the power consumption in receive mode within the mW range 
while extending the operation to a dual-band operation by adding the 868 MHz 
SRD band; 

• to operate from a single 1.5 V AA alkaline battery with a supply voltage as 
low as 0.9 V (corresponding to the battery end-of-life voltage) and achieve 
several years of autonomy; 

• to use a 0.18 /xm standard digital CMOS process (i.e. without any preci- 
sion analog components such as capacitors and resistors nor dedicated RF 
technology options such as substrate isolation); 

• to minimize the number of external components and the cost. 

The dual-band operation combined with the use of several channels allows 
for frequency diversity to solve the difficult problem of rejecting strong nearby 
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Table 1: Main characteristics of the WiseNET radio. 



Operating frequency 


434 MHz (ISM) and 868 MHz (SRD) 


Channels separation 


600 kHz (primary), 200 kHz (secondary) 


Propagation range 


2 km outdoor — ^ 10 m indoor 


Data rate / modulation 


< 100 kb/s with FSK (A/ = 25 kHz) 
< 4 kb/s with OOK 



interferers. The choice of staying in the 434 MHz and 868 MHz bands instead 
of moving to the popular and worldwide available 2.4 GHz band is mainly re- 
lated to power consumption. Indeed, as shown in Fig. 1 1(b), an important part 
of the power consumption (more than 50%) of a receiver is directly related to 
the circuits operating at or close to the RF frequency, such as the frequency 
synthesizer and the RF front-end. Since the power consumption of these blocks 
is approximately proportional to frequency, choosing sub GHz bands allows to 
limit the power consumption in the mW range. Also, it turns out that operating 
at 2.4 GHz with the chosen 0.18 ^m CMOS process would require a supply 
voltage in the range of 1.8 V, which is incompatible with the target 0.9 V mini- 
mum supply voltage. 

Among the key radio parameters identified in Section 2, the power in receive 
mode has been minimized by first choosing a 0.18 ^um CMOS standard digi- 
tal process. This allows for the selected sub-GHz frequency bands to trade the 
high frequency capability of minimum length transistors with lower current con- 
sumption, by biasing the devices even more in the moderate and weak inversion 
regions, including the devices working at RF. Power consumption is obviously 
also strongly conditioned by the selected radio architecture. Although a highly 
integrated solution is targeted, implementing the rejection of strong nearby in- 
terferers fully on-chip would have required a prohibitive dynamic range and 
power consumption due to the low-voltage requirement. This problem was cir- 
cumvented by using an external RF filter such as a SAW device. The chosen 
radio architecture is a superheterodyne with a high-IF followed by a direct con- 
version to dc, similar to the one presented in [26]. This architecture has the 
advantage that a significant gain can be achieved at the IF without prohibitive 
power consumption, while the channel selection is performed around dc. Hav- 
ing sufficient gain at the IF also reduces the SNR degradation due to the strong 
1/f noise present in deep-submicron CMOS processes and strongly affecting 
the analog baseband. Other external devices are a high-Q inductor used for the 
LC-tank in order to achieve the desired low phase noise and a few capacitors 
for the impedance matching to the antenna. Note that a special on-chip varactor 
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had to be designed in the chosen process to operate at such a low voltage while 
still offering a sufficient tuning range and a high quality factor in order not to 
degrade the tank overall Q-factor [27]. Similar to the -generation WiseNET 
transceiver, the 2"'^-generation also offers a digital RSSI, which is used by the 
WiseMAC protocol for preamble sampling and carrier sense activities. 

If all contributors within the receive chain are turned on, the total current 
drain is prohibitive to enable long duration operation on a single battery. When 
analyzing the individual contributions, the current consumption happens to be 
obviously larger for the RF blocks and significantly smaller for the baseband 
blocks. This is because the current is directly related to the frequency of oper- 
ation or the required bandwidth. On the other hand, tum-on times will vary in- 
versely with the frequency of operation, baseband blocks requiring much more 
time to settle than RF blocks. Therefore, significant energy can be saved by 
waking up the lower power baseband blocks before the power consuming RF 
circuits that anyway wake up very fast. The WiseNET chip offers a high degree 
of flexibility in controlling the individual operation of each blocks. A typical 
example of wake-up sequence that could be implemented is: first power-on the 
low-frequency reference clock, then the baseband path (channel filters, limiters, 
RSSI), followed by the frequency synthesizer and eventually the IF amplifiers 
and the RF front-end (LNA and down-conversion mixers). In addition, the sam- 
pling nature of WiseMAC can be efficiently exploited in the baseband part by 
turning on only the required block. For example, once the receiver chain up 
to the baseband channel filter has been activated, the RSSI can then measure 
the signal strength. If the activity is detected as sufficient by the RSSI, the de- 
modulation blocks may then be turned on for further symbol and frame analysis 
and error correction. In the case the power level is lower than an acceptable 
threshold, either the other channels or bands are probed in accordance with the 
MAC, or the complete receiver is turned off, thus optimizing power burned dur- 
ing both idle listening and receiving. The same power sequencing techniques 
can be used for the trans mi t path. 

The second most important parameter to be optimized is the wake-up time. 
This has been achieved by carefully defining the activation sequence of the dif- 
ferent transceiver blocks. It is then mandatory to use proper circuit design tech- 
niques in order to minimize the wake-up times of the slower blocks. For RF 
blocks, this delay is not an issue because of the larger currents as well as the 
small parasitic capacitances required to achieve RF performance, that will al- 
low very quick settling. Optimization of the setup delay is worth investigating 
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Figure 15: The WiseNET SoC including the dual-band transceiver and the blocks shown in 
Fig. 14 (3x2mm on TSMC 0.18 fim). 

for IF (IF amplification, IF to baseband mixers) and baseband blocks, because 
significant energy can be gained. A clear advantage of deep-submicron CMOS 
is that operation within the order of 100 MHz is feasible with proper biasing 
of the transistors in the signal path in weak or moderate inversion regions. The 
chosen high-IF architecture allows the IF blocks to be operated at frequencies 
where setup time is easy to minimize, without degrading the power consump- 
tion. Concerning baseband blocks, there is an obvious limitation due to the low- 
frequency poles inherent to these blocks. Nonetheless, the settling times have 
been reduced with techniques such as storing the bias points between channel 
sampling, or by boosting the startup-currents momentarily at wake-up, or else 
by switching the internal poles to higher frequencies for fast settling, then set- 
ting them back to the nominal frequency. 

The third key parameters are the receive-to-transmit and transmit-to-receive 
switching times. During these turnaround delays, all the blocks that remain “on” 
will bum power that is not useful for the communication. The architecture of the 
WiseNET SoC has thus been thought with respect to this particular MAC need. 
As an example, the same synthesizer and the same LO signals are used for both 
the receive and the transmit paths. This is made possible because the receiver 
and the transmitter are implemented according to the same high-IF architecture, 
which would probably not be the case if a single-chip transmitter had to be 
implemented separately from a single-chip receiver. In this way, the synthesizer 
remains “on” and is working around the same operating points, yielding fast 
turnaround times. 
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Table 2: Key radio parameters of the 2‘’‘^-generation WiseNET transceiver. 



Power consumption in Rx mode 


1.8 mW 


Power consumption in Tx mode 


31.5 mW 


Wake-up time 


800 fds 


Rx-to-Tx and Tx-to-Rx turn-around time 


400 IIS 



Table 3: Main measured results for the Rx and Tx blocks of the 2"'*-generation WiseNET 
transceiver. 




The resulting parameters are presented in Table 2. These values have been 
used in the simulations presented in Section 3. Some results measured on the 
2"‘^-generation WiseNET transceiver are presented in Table 3. A microphoto- 
graph of the complete WiseNET SoC is shown in Fig. 15. 



7 Conclusions 

A new ultralow-power solution for the implementation of multihop wireless 
sensor networks is presented. Low-power operation is achieved by a co-design 
approach between a new MAC protocol, called WiseMAC and especially devel- 
oped for wireless sensor networks, and a radio optimized for the required duty 
cycle operation. WiseMAC is a single channel contention protocol that requires 
neither set-up signaling nor network-wide synchronization. A high energy ef- 
ficiency in both low and high traffic conditions is obtained by the combination 
of preamble sampling and wake-up preamble length minimization. Although 
initially designed for ad-hoc networks, WiseMAC also provides low-power op- 
eration in infrastructure and mixed networks. 

The feasibility of integrating ultralow-power transceivers in a standard digi- 
tal 0.5 (Um CMOS process with 0.6 V threshold voltages was demonstrated with 
the L‘-generation WiseNET transceiver. The main objective of this first de- 
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sign was to demonstrate that a power consumption of 1 mW from a 1 V power 
supply was achievable in the 434 MHz European ISM band, with a —95 dBm 
sensitivity for a data rate of 24 kb/s. These encouraging results led to the design 
of the -generation WiseNET transceiver, specifically optimized for the new 
WiseMAC protocol. 

Among the many radio parameters that impact the energy efficiency of the 
system, the power consumption in receive mode, the wake-up and the turn- 
around times and current consumption in sleep mode were carefully optimized. 
In order to solve the difficult problem of rejecting strong nearby interferers, 
some frequency diversity was added in the 2"'^ -generation WiseNET transceiver 
making it a dual-band transceiver operating in the European 434 MHz and 
868 MHz bands. It is powered from a single alkaline 1.5 V battery and can 
operate down to 0.9 V. It consumes 1.8 mW (1.8 mA at 1 V) in receive mode, 
with a sensitivity of —104 dBm for a BER=10“^ and a 25 kb/s bit rate. The sleep 
current is as low as 3.5 /j,A. This radio is part of a complete SoC integrated in a 
0.18 jj,m standard digital CMOS process and including a sensor interface with 
two ADCs, a CoolRISC 8-bit /j,C with a low-leakage memory, several time basis 
and a elaborated power management unit. 

Using the 2“‘^-generation WiseNET transceiver with the WiseMAC protocol, 
a relay sensor node consumes about 25 fj,W when forwarding packets of 56 
bytes every 100 seconds. The radio transceiver used on motes [28] requires 
24 mW in receive mode (8 mA at 3 V). If S-MAC or T-MAC is used with 
such a transceiver at a duty cycle of 10%, a minimum power consumption of 
2.4 mW is obtained, which is about 100 times more than what is achieved with 
the presented WiseNET solution. 
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Abstract 

Sensor nodes comprising a power source and means for 
communication significantly reduce the barrier to sensor 
deployment and enable a vast array of new applications. Power 
represents the most significant constraint in many applications. 
We discuss available power sources and the power dissipation of 
circuits and optical and RE communication. A solar powered node 
with passive optical communication has a volume of just 16mm^ 
and measures ambient light and acceleration. A larger unit 
fabricated from off-the-shelf parts and RF communication 
implements a self-configuring network for communicating sensor 
results. 



1. Introduction 

MEMS and related sensor technologies have significantly expanded the range of 
inexpensive and easy to use sensors. But while the cost of sensors has come 
down, deployment is still a major barrier in many applications. In many uses, 
such as building automation or automotive sensing, the cost for power delivery, 
data cabling, and the necessary control structures exceeds that of the sensors by 
orders-of-magnitude. 

A solution that adds computation, communication, and a power source to the 
sensor has the potential to overcome this problem. In this article we explore 
existing and emerging solutions for autonomous sensors, including networks of 
such sensors. We explore the limits on packaging density for an autonomous 
sensor with integrated power source, computation, and communication. 

Scaling trends in microelectronics are driving the reduction in cost, size, and 
power dissipation of computation, communication, and sensing. Extrapolating 
these trends we envision sensors to become increasingly ubiquitous. Self 
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powered and arranged in self-organizing networks they will vastly increase our 
ability to gather information about our environment. Possible outcomes include 
sensor networks reporting real time to freeway traffic conditions or guide us to a 
vacant parking place. At the office, a sensor network adjusts the climate to our 
individual preferences all while conserving energy in our absence. Sensors help 
us find a free conference room, lost glasses, and sound appropriate alarms if our 
briefcase leaves without us. 



2. Smart Sensors 

Temperature, magnetic field, and light / imaging sensors have long exploited the 
benefits of integrated circuit technology. Advantages include efficient high 
volume means of production, small size, low power dissipation, and the ability 
to easily include additional functions such as signal conditioning, temperature 
compensation, or self-calibration. The electrical interface and IC compatible 
packaging simplify system integration and reduce size, cost and power 
dissipation. 




Fig. 1. Integrated 3 -axis acceleration sensor with on-chip signal conditioning 

and A/D conversion. 
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The development of MEMS and related technologies over the last two decades 
extended these benefits to many new types of physical, chemical, and 
biochemical sensors. For example, the integrated accelerometer shown in Figure 
1 includes a three axis sensing element for detecting acceleration in three 
orthogonal directions [1]. The mechanical element is surrounded with electronic 
circuits for signal conditioning and analog-digital conversion. Additional 
functions such as self test, temperature compensation, error correction, and 
factory or user calibration can be added as needed. The result is a packaged 
component that differs little from other electronic devices and can easily be 
integrated into a larger system. This is in stark contrast with conventional 
electromechanical sensors fabricated with precision machining techniques. 
Those usually require special handling, calibration, and mounting, and typically 
depend on external signal conditioning. Similar integrated platforms are 
available for sensing pressure [2], angular rate [3], humidity [4], and strain, just 
to name a few. 

These devices are ideal for use e.g. in machinery, washing machines, or 
automobiles. However, widespread deployment for building and environmental 
sensing is still limited by the high cost of sensor installation. In many cases, 
wiring costs exceed those of the sensors by orders-of-magnitude, thus still 
severely restricting the range of applications where sensors can practically be 
deployed. 



3. Sensor Motes 

Sensor motes attempt to overcome the high cost of sensor deployment by 
organizing sensors in an inexpensive, autonomous package. In addition to the 
sensors, each mote contains a power source, wireless communications link 
(usually an RE transceiver), and small microcontroller. Figure 2 shows an early 
prototype of a mote developed at Berkeley that is capable of monitoring 
temperature, atmospheric pressure, humidity, light intensity, acceleration, and 
magnetic field. The mote also includes a simple microcontroller, short range 
radio transceiver, and battery for up to two years autonomous operation [5]. 
Enhanced versions of such motes are being designed by many researchers and 
also available commercially [6,7,8]. 

Depending on design and power source, motes are capable of autonomous 
operation for several years. Spontaneous peer-to-peer networking enables 
dynamic addition and deletion of motes without global changes to the sensing 
infrastructure [9]. Constrained by size and power consumption limits, the range 
of reliable communication of individual motes is less than 20m in indoor 
environments. Networking overcomes this limitation with motes acting as 
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bridges between peers. Since many sensor applications require only a low traffic 
volume, this solution is usually acceptable. 




Fig. 2. Prototype sensor mote with battery power, RF transceiver, 
microcontroller, and off-the-shelf sensors for six different measurands. 



Figure 3 illustrates the benefits of wireless sensor motes in a laboratory 
experiment where a building design is tested for its ability to withstand 
earthquakes. Compared to the traditional solution with wired sensors taking 
days to install, the motes can be installed in hours. In larger projects or projects 
that evolve over time, the benefits are even more significant. 



4. Sensor Mote Design Considerations 

The concept of a sensor mote hinges on several assumptions. Of these, the need 
to provide power for months or years of operation is one of the most challenging 
and permeates the design of all components of the mote. This section 
summarizes the most likely candidates for power sources in sensor motes 
followed by an analysis of the power requirements of sensors, interface circuits, 
and wireless communication devices. Each subsection also summarizes potential 
future developments and limits. 
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Fig. 3. Wireless sensor motes on a building structure on a shaker test table. 
The conventional solution (right) requires a rack full of electronics and miles of 

wires [17]. 



4. 1 . Energy Sources 

Many sensor applications demand reliable and unattended operation over 
periods of months to several years. In these applications, many of the most 
salient metrics such as sensor fidelity and bandwidth depend substantially on 
the available power source. 

Batteries are the most evident and common solution when connection to the 
power grid is not an option. For sensor applications, energy per weight or 
volume along with shelf life is the most important battery characteristic. 
Presently, non-rechargeable Lithium batteries are available with densities of up 
to 700\^kg or about 4J/mm^ [10], equivalent to a draw during one month of 
lOpW/gram of battery. New battery technologies such as Aluminum-air that are 
currently under development promise energy densities up to 8000Wh/kg. 
Recent progress in battery development resulted in a doubling of the energy 
density every six years. By comparison, microprocessor performance has been 
doubling every 1 .5 years over the last two decades. 

Hydrocarbon fuels are unlikely to change this outlook significantly. For 
example, the upper heating value assuming perfect combustion of gasoline is 
12,700Wh/kg (~40J/mm^). . Unfortunately, the inefficiency and size of the 
power converter reduce the available electrical energy considerably. Both micro 
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engines and fuel cells are presently the object of significant academic and 
commercial interest. Hydrogen (50kWh/kg) offers much higher density per 
weight that however is difficult to exploit in situations where volume is 
constrained also. More exotic solutions, such as radioactive power sources, also 
suffer from often insurmountable practical problems. 

Since most present sensors have higher power requirements, the limited energy 
density of available batteries underlines the importance of a dramatic reduction 
of power dissipation through aggressive use of measures such as duty cycling, 
improved transduction concepts, and low power interface circuits. Some 
advantages can also be gained in sensor networks by relying not on the result of 
a single “golden” sensor, but instead using the outputs from several transducers 
to obtain a more accurate overall reading. 

Scavenging energy from the environment removes battery life time as a 
constraint. Solar cell, thermopiles and mechanical vibration are among the most 
popular approaches. Of these, light is particularly attractive since it is relatively 
easy to convert to electrical energy. Full sun light delivers ImW/mm^ or 
IJ/day/mm^. The power of bright indoor illumination is orders-of-magnitude 
smaller, typically around ~lpW/mm^. The best solar cells achieve up to 30% 
conversion efficiency, although 10% is more typical for uncooled silicon cells. 
Especially for indoor applications relatively large panels are required even for 
low power sensors. These solutions also usually feature a rechargeable backup 
power source to ensure around the clock operation. 

The availability of mechanical vibration as a power source depends strongly on 
the application. Vibration spectra of ventilation ducts, windows, or industrial 
motors suggest harvestable energy on the order of lOmW/g mass of the 
converter [11, 12]. Since the density of silicon is only 2mg/mm^, this source has 
limited application in a micro-sensor setup. 



4.2. Signal Conditioning Circuits 

Electronic circuits are responsible for evaluating sensor outputs, data 
conversion, and control. Power consumption is the major factor constraining the 
design. In scaled CMOS the power dissipation of digital circuits is small even 
considering the limitations of available sources. For example, a recently 
published 8-bit controller dissipates just 12pJ per operation [13]. Assuming a 
lOkHz average execution frequency, this corresponds to three month of 
continuous operation per Joule. Many sensing applications are amenable to duty 
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cycling, increasing the potential operation time from a minuscule battery to 
years. 



"Number of logic gates with same energy consumption as a state-of-the-art B-bit ADC” 




ADC Resolution [B] 

Fig. 4. Analog versus digital energy consumption. 

Analog circuits are constrained by electronic noise, which is not or only weakly 
correlated with minimum feature size. Figure 4 compares the power dissipation 
of logic gates to that of a thermal noise limited analog-to-digital converter 
assuming same operating speed. Accordingly, the power dissipation of a 10-bit 
converter fabricated in a 0.13pm CMOS process is comparable to the 
dissipation of 500,000 logic gates. 

This strongly suggests new converter architectures that can take advantage of 
peculiarities of sensor outputs to reduce power dissipation. For example, we 
have designed an 8-bit successive approximation ADC with an energy 
consumption of just 31pJ per sample [14], compatible with the limitations of 
typical power sources. In many sensor applications, the signal changes slowly 
or is only of interest if it exceeds a set threshold. Rather than operating 
autonomously, the converter is controlled by a microcontroller [13] that can 
take advantage of such prior knowledge to further reduce energy consumption. 

The power dissipation of the sensor interface can be much higher and dominate 
that used by conversion and computation. Just as in digital systems, the power 
dissipation of analog circuits can be reduced significantly with power-down 
modes. The output from many sensors is very small and must be amplified and 
filtered before conversion. This tradeoff is very steep. For example, the energy 
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required to sense a ImV change is on the order of IpJ, while for a IpV 
resolution the consumption increases to 1 pJ per sample, significantly more than 
that needed for computation [15]. 



4.3. Communication 

Sensor networks require some form of communication. Both RF and optical 
solutions are being investigated. Tradeoffs include power dissipation, distance, 
and antenna size. 

For example, a GSM receiver uses approximately 2pJ per bit received and 
around 40pJ for transmission with a range up to several km. Short haul 
solutions such as Bluetooth have significantly lower power requirements, with 
about 0.1 pJ per bit and a 10 ... 100m communication range for low and high- 
power implementations, respectively. Active power dissipation is approximately 
lOOmW. The Bluetooth standard defines several power-down modes to reduce 
power dissipation by turning the receiver on only periodically to listen for 
transmissions on a predefined schedule. Many sensor networks require the 
ability to give rapid notification of alarm conditions that can occur at any time. 
Such constraints can increase standby power signifieantly. 

RF signals undergo attenuation proportional to the communication distance d 
raised to a power that depends on the environment. The attenuation of outdoor 
line-of-sight transmissions is proportional to cf. Indoors, the attenuation 
depends on partitions and building materials and is typically around d^. Because 
of this strong attenuation, it is advantageous to break long communication path 
into several shorter segments. For example, a link that consumes lOOnJ/bit to 
communicate over a distance of 50m takes only 5 x lOOnJ/5'^ = 0.8nJ/bit if the 
total distance is broken into five segments of 10m each, a 2^==125 fold 
reduction. The fixed overhead reduces the actually achieved savings, multi-hop 
schemes increase latency and can reduce the throughput of the network. Still, 
the benefits of dense sensor networks enabling short range communication are 
very significant. 

When applicable, optical communication has several advantages for 
communication from small devices with limited power. First, optical radiators, 
such as laser diodes, are small and have large antenna gain compared to the 
much larger antennas needed for microwave communication. Secondly, media 
access control can be implemented using spatial division multiplexing, which 
just relies on devices located in different positions. This is a simpler and lower 
energy technique than those used for RF communication, e.g. frequency, time. 
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or code division multiplexing. Typical figures for optical communication are 
20pJ/bit for distances up to 100m and lOnJ per bit for up to 10km [18,19]. 



4. Future Autonomous Sensor Motes 

Cost and size are very important aspects of sensor motes. The vision of 
ubiquitous sensing depends on dense networks of unobtrusive components. 
Figure 5 shows a mote fabricated from off-the-shelf components comprising a 
lithium battery, microcontroller, several sensors, and a simple RF link. The 
resident software automatically establishes communication with neighboring 
nodes. A message forwarding mechanism passes results over larger distances 
exceeding the capability of the radio. By rerouting messages in case of node 
failure, the message passing algorithm also provides fault tolerance. 




Fig. 5. RF sensor nodes. 



Custom integration promises much smaller size and cost. Figure 6 shows an 
attempt of a custom sensor node. For practical reasons, the device is realized on 
several substrates, but could conceivably be integrated on a single die. The 
device includes a solar cell for powering, micromachined comer cube reflector 
(CCR) for passive optical communication, ambient light sensor, accelerometer, 
A/D converter, 3MHz oscillator, and digital control circuits. The entire solution 
comprises a volume of only 16mm^. The CCR is capable of communication 
over distances up to 180m and consumes just 16pJ/bit on average [16,18]. 
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While substantial effort is required on advanced packaging, improved 
communication protocol, power management, this result demonstrates the 
opportunities for future autonomous sensor motes. 




Fig. 6. Solar powered autonomous sensor node. 
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PART II: INTEGRATED HIGH-VOLTAGE 
ELECTRONICS AND POWER MANAGEMENT 

Michiel Steyaert 

The trends towards deep submicron technologies results in ever decreasing 
power supply voltages. Moreover, the power drain of high performance circuits 
becomes very important for battery powered systems. For that two important 
issues are discussed: High voltage techniques and devices in standard 
technologies and power management techniques. 

The first three papers are addressing the High-Voltage issues. The first paper 
describes the problems based on automotive requirements. Due to the high 
temperature requirements, the required EMC and over voltage protection 
requirements, the design of those circuits have to be performed very carefully. 
Examples such as band gap references and overload protection circuits are used 
as example to demonstrate and to improve design techniques. 

The second paper focuses the High-Voltage issue on class-D audio power 
amplifiers. Due to the combination of special technologies, careful design and 
using circuit techniques to accurate control the ’zero switching dead time’ it is 
demonstrated that very low distortions in combination with high efficiencies 
can be achieved. 

The third paper handles the RF applications. Also in those applications the lower 
voltages available in deep submicron results in difficult power amplifier designs. 
By developing DMOS structures in standard CMOS technologies, higher 
voltages can be achieved. However it is not clear at all if class E topologies are 
used that the gain due to higher voltages will result in higher power amplifier 
efficiencies due to the higher input capacitances of the DMOS topologies. 

The next three papers deal with power management. The fourth paper discusses 
power management issues for CPU’s for desktop and mobile systems. Due to the 
large (up to 14 A) load changes and required protection circuits, special power 
converter topologies are analyzed and discussed 

The fifth paper addresses power management issues for RF applications. Due to 
single battery applications, the investigation and trade-offs of different building 
blocks direct battery operated are discussed. Timing and PSRR issues are of 
course as a result very important. By using those techniques, very low power 
RF systems can be realized. 
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Finally, the last paper discusses the optimization of the total power conversion 
chain. This includes from batteries, external components down to the IC 
regulators. Special battery models are generated to optimize the whole chain 
towards performances, size and cost. 




AUTOMOTIVE IC-DESIGN 



Heinz Zitta, Wolfgang Horn, Christian Lenzhofer 
Infineon Technologies 
Villach, Austria 



Abstract 

The design of integrated circuits for automotive applications has 
to face multiple challenges, such as high voltage capability, 
robustness in a wide temperature range, good electromagnetic 
compability, protection against short-circuit and overvoltage 
conditions, and predictable behaviour also in case of missing 
supply voltages. 

This paper presents some analog circuit examples to deal with the 
mentioned circumstances without loosing functional performance. 



1. Introduction 

Today’s suppliers of automotive electronics prefer a system-on-a-chip approach 
to achieve the ever increasing reliability requirements. Smart Power 
technologies make it possible to combine logic, analog functions and multiple 
power output stages on one chip. Special design efforts are necessary for the 
analog circuits to obtain function and accuracy under tough conditions, caused 
by the high power dissipation, which directly implies very high chip 
temperatures. Other than wireless, consumer or desktop applications, automotive 
electronics have to withstand operating temperatures in the range of -40°C to 
175°C and even more for a short time in short-current or broken-load conditions. 
Another challenge is the handling of disturbant electric pulses occuring ran- 
domly during operation. These pulses are caused by the switching of the various 
actuators distributed along the electric power net of the car or by the actuators 
themselves. They must not lead to malfunction or destruction of the integrated 
circuit, but they are often the root cause for activating parasitics. 

2. Technology 

Integrated circuits for automotive applications require power technologies which 
can withstand 60V, can handle high currents of up to 50A and are reliable over a 
wide temperature range. 

Two kinds of technologies are used to fulfill these requirements: 
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2.1 Self-isolation Smart Technology with vertical current flow 

This kind of power semiconductor technology is based on the CMOS self- 
isolation approach. To minimize Rds(on) for a given chip area, the n-channel 
DMOS power device is vertical, using the backside metallization of the chip as a 
drain connection. This enables very low on-state resistances in the milli-Ohm 
range and consequently is very well suited for switching high currents in 
automotive applications. As shown in Fig. 2.1 the drain connection is equal to 
the substrate of the chip and, therefore, always has to be connected to the 
positive supply voltage. This leads to the restriction that multi-channel switches 
are only possible in common-drain (high-side) configuration. Analog and logic 
circuits can be integrated on the same chip by using the low-voltage CMOS and 
the high-voltage lateral MOS devices. 




Fig. 2.1: Cross section of vertical Smart Technology 



2.2 Junction isolated Smart Power Technology (SPT) 

The junction-isolation approach enables the integration of CMOS, bipolar and 
power-DMOS on one chip, therefore, this type of technology is known as BCD 
(bipolar-CMOS-DMOS) [1,7]. The cross- section of some devices is shown in 
Fig. 2.2. Each device is placed within an n-well terminated by the highly n- 
doped buried layer and n-sinker. The isolation among the n-wells by p^-areas 
allows for the implementation of several devices working independently on a 
chip. Therefore this technology allows to integrate a system solution on one chip 
with multifold low-side or high-side outputs or even power stages in bridge 
configuration including all circuits for analog diagnostics and protection and 
logic circuits. 
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Fig. 2.2: Cross section of some devices of an SPT technology 



3. Robust design of a bandgap reference circuit 

As mentioned before, in BCD semiconductor technologies the devices sharing 
one substrate are isolated from each other by reverse-biased p-n-junctions. 
Reverse bias is usually established by connecting the p-doped substrate to the 
most negative supply voltage, which often is equal to ground. Under normal 
operating conditions, the parasitic interaction among the devices via the 
substrate is minimized this way. There are, however, situations where the 
isolation is degraded or partly lost. Two examples are very high operating 
temperatures (>150°C) and the presence of minority carriers in the substrate. In 
the automotive environment, both occur frequently and consequent degradation 
of performance or loss of functionality have to be avoided by a robust design. 

In this section, the implications of very high operating temperatures and 
minority-carrier injection are discussed. Possible ways to deal with the issue are 
presented and a robust bandgap reference circuit is shown as a design example. 

3.1. High Temperature Operation 

The automotive operating temperature ranges from -40°C to 150°C. At these 
temperatures, junction leakage currents are small compared to the operating bias 
currents and usually do not limit the overall performance. There are, however, 
requirements for an extended temperature range of up to 175°C during operation 
and more than 200°C during fault conditions such as overload (short circuit). 
Junction leakage currents rise exponentially with temperature and can, at 
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T>150°C, exceed the operating bias currents by far. The leakage currents can be 
expressed as 



h=<iA 



\ D\ 



+ - 



Lp No L„ N, 



2 , . W 2 

n. +qA — rij 



( 1 ) 



with q being the electron charge, A the junction area, Dp and D„ the diffusion 
constants, Lp and Ln the diffusion lengths, Nd and Na the doping levels, ni the 
intrinsic carrier density, w the junction width, and To the effective lifetime of the 
minority carriers in the depletion region. 




Fig. 3.1: Brokaw cell with substrate junctions (red circles) 

From equation (1), it is obvious that leakage currents are proportional to the 
junction area A and that higher doping levels lead to lower leakage currents. 
Therefore, the junctions isolating the devices from the substrate are the ones 
with the highest leakage currents due to their large area and to the low doping 
level of the substrate [3]. This can be exploited in circuit design by avoiding 
substrate junctions in the critical signal path and biasing those junctions from 
low-impedance sources. As shown in the example in Fig. 3.1, some commonly 
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used circuits do not follow this rule and therefore in high-temperature design 
new circuit topologies have to be used [9]. The red circles in Fig. 3.1 mark 
nodes associated with n-epi wells which are driven from high-impedance 
sources. Clearly parasitic currents at those nodes will degrade the circuit’s 
performance. A measurement result of the reference voltage obtained from a 
circuit with the topology of Fig. 3.1 is shown in Fig. 3.2. At temperatures above 
approximately 150°C, a deviation from the expected order characteristic can 
be observed and at T>180°C, the influence of leakage currents gets dominant 
and leads to malfunction above 200°C. 




Fig. 3.2: Standard bandgap reference circuit performance and 

total current consumption 



3.2. Minority Carrier Injection 

The presence of minority carriers in the substrate is a well known problem in 
BCD semiconductor technologies and a major cause for redesign [7, 8]. 
Minority carriers, i.e. electrons when a p-type substrate is used, are injected into 
the substrate when the n-epi well of a device gets forward-biased in respect to 
the substrate. Subsequently, these electrons diffuse over large distances (in the 
order of the die size) and are partially swept across reverse-biased epi-substrate 
junctions of other devices in the same substrate. There, they impose parasitic 
currents which are highly dependent on doping levels, temperature, amount of 
injection, and layout and therefore are hard to predict. 

The nodes where parasitic currents caused by minority carrier injection have to 
be expected are the n-epi wells of each device. These are exactly the same nodes 
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where the largest high-temperature junction leakage currents will be observed 
(see section 3.1). 

From this, it can be concluded that a circuit topology which is designed for high- 
temperature operation will also be inherently robust against parasitic currents 
caused by minority carriers in the substrate and vice versa. 




Fig. 3.3: Robust bandgap reference - simplified schematic 



Fig. 3.3 shows an example of a bandgap reference circuit which has been 
designed in order to avoid parasitic currents in the control loop. This is achieved 
mainly by supplying all nodes associated with n-epi wells (green circles in 
Fig. 3.3) from low-impedance sources such as the voltage supplies. This is in 
accordance with the design rule derived in section 3.1. The circuit uses a series 
connection of two base-emitter junctions of n-p-n transistors which, in turn, are 
connected in series with a dVBE generator. This way, a stable reference voltage 
of 2.4V is achieved according to the bandgap reference principle [10]. A simple 
CMOS opamp is used to close the control loop and to establish equal voltages at 
the gates of Ml and M2. Due to the stacking of two base-emitter junctions, the 
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ratio of R2/R1 can be kept low. Therefore, the offset of the CMOS opamp has 
less influence on the output voltage. 

Measurement results of the robust bandgap reference circuit are presented in 
Fig. 3.4. The scale of the voltage has been chosen so that the upper and lower 
limit in the graph correspond to the average reference voltage ±1%. As can be 
seen, the reference voltage is accurate within a fraction of one percent in a 
temperature range of up to 295°C (which was the limit of the measurement 
equipment). The total current consumption of the chip is also shown to 
emphasize the effect of junction leakage currents. 

Tests have also been carried out to demonstrate the immunity against minority 
carrier injection. A large epi-substrate junction of a power-DMOS transistor has 
been forward-biased to inject a large amount of electrons into the substrate. As 
expected, no influence on the reference voltage could be detected until the 
injected current reached excessive levels and the power-DMOS was destroyed. 




TPC] 



Fig. 3.4: Robust bandgap reference - measurement result 

4. Failsafe warning-lamp driver circuit 

The typical configuration of an automotive system chip with a warning-lamp 
output is shown in Fig. 4.1. It includes power outputs to drive the application 
specific loads and analog diagnostic circuits which are responsible for testing 
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the internal circuits and the external components. All diagnostic information is 
combined and activates an output which drives a lamp. This warning lamp is 
situated in the dashboard of the car. It is switched on for a short time when the 
car is started and then - if everything is okay - it is switched off In case of any 
error in the system, this warning lamp is switched on again and the car driver 
gets the information that, e.g., the airbag or the ABS system do not work 
properly. 




Fig. 4.1: Block diagram of a smart-power system chip 

The definition: “In case of an error the lamp has to be switched on” includes 
also the failure mode that the supply line is disconnected. Loosing the battery 
connection VS of the module should result in an activated lamp. How to 
consider this requirement in designing a driver circuit is described in detail 
below. 

A block diagram of the lamp driver circuit is shown in Fig. 4.2. Under normal 
supply voltage conditions, the load RL is fed by the voltage VBAT, whereas the 
logic and the gate driver circuits are supplied by VS. Now we assume the 
following error condition: The voltage VS is missing, but the voltage VBAT 
exists. In that case, the lamp should be switched on. This would report the error 
condition of the missing VS because all errors are reported by switching on the 
lamp - normally by the logic circuit. But the logic circuit is now missing its 
supply voltage! To achieve the desired function, the gate voltage has to be 
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captured from VBAT via the load. It would be no problem if the DMOS was 
switched off. But the switched-on DMOS does not provide a sufficient voltage 
VD at its drain to supply the circuits. The target is to develop a gate-driver 
circuit which can manage this condition. This is done in two essential steps: 
Firstly, a charge pump is used to multiply the low voltage VD to an adequate 
level. And secondly, a regulation circuit controls the saturation voltage VD in 
such a way that it will not drop below a minimum value which is the required 
operation voltage of the charge pump. Additionally, the design of the overload 
protection circuit has to accept a low-voltage VD as a supply voltage. 




Fig. 4.2: Block diagram of the lamp driver circuit 

A more detailed circuit diagram is shown in Fig. 4.3. The gate voltage of the 
DMOS is supplied either via VS or via VCP, which is the output voltage of the 
charge pump. The input voltage of the charge pump is supplied by VD, the drain 
voltage of the power DMOS. A bias circuit consisting of the devices Q1 to Q4 
generates a stabilized current IB which is mirrored to supply the charge pump 
and the regulation circuit. To keep the bias circuit operating, the voltage VD 
should not drop below two base-emitter voltages (VBE). The control loop is 
described below. The DMOS gate is charged by the current 18 and discharged 
by the current 110. The discharging current is cut off by means of QIO so that 
the DMOS will remain switched on. If the drain voltage VD decreases, a point 
will be reached where the current 12 becomes zero. Then the current 19 and, 
correspondingly, 110 will increase. This leads to a state where 18 is equal to 110 
as a stable operating point. The large gate capacitance of the DMOS acts as a 
sufficient loop compensation of this control circuit. The final potential of VD is 
defined by the voltage drops of Ml 1 and Q9. In practice, this amounts to about 
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2V, which is high enough to feed the bias circuit and the charge pump. On the 
other hand, it is low enough to be accepted as a drain-to-source voltage drop of 



the DMOS. 




Fig. 4.3: Circuit diagram of the lamp driver circuit 

The charge pump circuit is shown in Fig. 4.4. To achieve the required gate 
voltage, a multi-stage design was chosen. The five inverter stages are directly 
connected in a feedback loop to form a ring oscillator, therefore, no additional 
oscillator is necessary. 




Fig. 4.4: Charge pump circuit 
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The overload protection circuit is shown in Fig. 4.5. It provides current limiting 
and temperature protection of the DMOS. The circuit is supplied directly from 
the drain voltage of the output power transistor and does not need any additional 
supply voltage or reference voltage. The bias generator is similar to the one 
shown in Fig. 4.3. 

The current 12 depends on AVBE and R2, where AVBE is given by comparing 
Q1 + Q4 with Q2 + Q3: 

AVBE = Vt •[ In(AreaQl) + ln(AreaQ4) - ln(AreaQ2) - ln(AreaQ3) ], 

Vt = kT/q depends on the absolute temperature T, thus the bias circuit is actually 
a so-called PTAT source. This means that the current is proportional to the 
absolute temperature. Adding the current mirror Q7, Qll and the devices R4, 
Q14 completes the circuit to a temperature shut-off block. The voltage drop 
across R4 is increasing with temperature. Compared to the decreasing VBE 
characteristic of the bipolar device Q14, this results in a well defined 
temperature shut-off point. 

The second task of the circuit is current limitation. The current threshold is 
defined by the AVBE of Q15 and Q16, compared to the voltage drop at R3. The 
shunt resistor R3 is part of the metal interconnect layer. This leads to a first- 
order temperature compensation of the current threshold. 




Fig. 4.5: Overload protection circuit 
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5. Temperature sensor for very high temperature thresholds 

Thermal protection of smart power circuits requires a temperature sensor. The 
temperature threshold has to be low enough to protect the device in case of any 
overload like a short circuit or an overvoltage pulse. On the other hand, the 
temperature threshold should be sufficiently high so that the sensor is never 
activated under any possible operating conditions including highest ambient 
temperature and overvoltage. The required temperature threshold depends on the 
position of the temperature sensor in relation to the hot spot of the power- 
DMOS [2]. It is typically in the range of 175°C to 200°C if the sensor is situated 
at the edge of the power device. But for some applications, as e.g. power ICs for 
activating airbag firing, the worst- case operating conditions can lead to very 
fast heating of the DMOS, which results in hot spot temperatures of above 
250°C. To allow this high temperature for operation and, on the other hand, to 
protect the device against thermal destruction, the temperature sensor should be 
situated close to the hot spot in the center of the power device. This requires 
temperature sensors which can reliably be adjusted in a temperature range of 
250°C and higher. 

For accurate temperature sensing, it is convenient to use a bipolar transistor and 
to compare the characteristic of the base emitter voltage with a constant 
reference voltage, which is generated e.g. from a bandgap reference [6], see Fig. 
5.1. 



VCC 
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Temperature 



Fig. 5.1: Temperature sensor with hysteresis 



The problem of a bipolar device in the very high temperature range is the 
increase of junction leakage currents from collector to substrate and collector to 
base at high temperatures. A measurement of the VBE voltage of an n-p-n 
transistor over temperature is shown in Fig. 5.2. The measured data shows that 
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the linear behavior of the VBE of approximately -2mV/K is valid in a wide 
temperature range up to 250^^0 but drops down in the higher temperature range 
if the bipolar n-p-n is driven by a constant collector current. By forcing a 
constant emitter current to the transistor, it would be possible to extend the 
useful range of the VBE voltage for temperature measurement. 




Fig, 5.2: Measured data of base-emitter voltage vs. temperature 

If a simple temperature sensor circuit as shown in Fig. 5.1 is used in an extended 
temperature range, the leakage current will lead to a wrong threshold for the 
over-temperature detection. Fig. 5.3 shows an improved circuit where the 
leakage current of the bipolar temperature sensor T1 is compensated with the 
help of a second transistor T2 which is deactivated by a shorted BE diode and 
situated close to the sensor to have the same temperature as the sensor and 
subsequently to have the same leakage current. The PMOS current mirror M2, 
M3 compensates this leakage current by feeding the same amount of current into 
the collector of the sensor. An additional current mirror T3,T4 sinks the 
additional current due to high temperature from the base of the sensor. This 
concept has proven to work reliably up to 300°C and allows for extending the 
usable range of bipolar sensors for power switches with very high power 
densities. 
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VCC 




Fig. 5.3: Temperature sensor with Fig. 5.4: Temperature sensor 
full leakage current compensation with collector at supply line t 



Another concept for an improved temperature sensor circuit is shown in Fig. 5.4. 
Here the collector of the sense-transistor is directly connected to a power supply 
line which is able to deliver the leakage current without an additional 
compensation device. Comparison of the base-emitter voltage with the reference 
voltage is performed by an MOS comparator. 



6. Conclusions 

Analog circuits, which are part of smart power circuits for automotive 
applications, have to withstand tough requirements. A selection of circuits 
which deals with the problem of very high temperatures, substrate injection 
problems caused by reverse-current conditions, and a solution for functionality 
without supply voltage are presented. Modem smart power technologies offer all 
analog and digital 

components necessary to integrate system solutions, including the power stages 
on one chip. Robustness of analog design in this surrounding will be one key 
factor for success of extended smart- power system integration in the future. 




151 



7. References 

[1] M.Stecher et al, “Key Features of a Smart Power Technology for 
Automotive Applications”, Conference on Integrated Power Systems 
CIPS 2002, Bremen, Germany 

[2] M.Glavanovics, H.Zitta, “Dynamic Hot Spot Temperature Sensing in 
Smart Power Switches”, ESSCIRC 2002 

[3] W.Hom, H.Zitta, "A Robust Smart Power Bandgap Reference Circuit for 
Use in an Automotive Environment", IEEE Journal of Solid-State Circuits, 
Vol. 37, No. 7, pp. 949-952, July 2002 

[4] M.Glavanovics, H.Zitta, “Thermal Destruction Testing: an Indirect 
Approach to a Simple Dynamic Thermal Model of Smart Power 
Switches”. ESSCIRC 2001, Proc. p.236 ff. 

[5] H.Zitta, H.Schmidt-Habich, M.Stecher, “SPT4/90V - A Voltage 
Derivative of the Smart Power Technology opens the Door to the 42 V 
PowerNet”, Congress on 42V PowerNet, Villach, Austria, Sep 1999 

[6] H.Zitta, “Smart Power Circuits for Power Switches Including Diagnostic 
Functions”, Analog Circuit Design, Kluwer Academic Publisher 1995, 
p.249 ff. 

[7] B.Murari, F.Bertotto, G.A.Vignola (eds.), “Smart Power ICs: 
Technologies and Applications”, Springer 1995 

[8] R. J. Widlar, “Controlling Substrate Currents in Junction-Isolated IC’s”, 
IEEE Journal of Solid-State Circuits, Vol. 26, No. 8, pp. 1090-1097, Aug. 
1991 

[9] A. P. Brokaw, “A Simple Three-Terminal IC Bandgap Reference”, IEEE 
Journal of Solid-State Circuits, Vol. SC-9, No. 6, pp. 388-393, Dec. 1974 

[10] R. J. Widlar, “New Developments in IC Voltage Regulators”, IEEE 
Journal of Solid-State Circuits, Vol. SC-6, No. 1, pp. 2-7, Feb. 1971 




DESIGN OF CLASS-D AUDIO POWER AMPLIFIERS IN SOI 

TECHNOLOGY 

Marco Berkhout 

Philips Semiconductors Nijmegen 



Abstract 

The design of integrated class-D audio power amplifiers is 
discussed. The amplifiers are realized in an SOI based BCD- 
technology that is inherently free from latch-up. The core of a 
class-D amplifier is the switching output stage. Accurate control 
of the switch timing is essential for good audio performance. In 
order to achieve this, detailed knowledge of the transient 
dynamics is necessary. Robustness is the crucial thread 
throughout the design. Robustness is complicated by high 
voltages and the switching of large currents in the output stage. 
Further the output stage needs to be robust against all sorts of 
fault conditions such as short circuits and electro-static discharge. 



1. Introduction 

The operating principles of class-D amplifiers have been known for a long time, 
but only recently semiconductor manufacturers are able to produce reliable 
integrated class-D amplifiers of sufficient quality [1]. The audio performance of 
modem class-D amplifiers equals or even exceeds that of conventional class-AB 
amplifiers. The distinguishing feature of class-D amplifiers is the high 
efficiency which allows for smaller heatsinks or higher output power before 
mnning into thermal limitations. This is especially advantageous in multi- 
channel systems such as DVD receivers. The main drawback of class-D 
amplifiers is that the switching at the output is a source of electro-magnetic 
interference (EMI). A careful optimization of the application is crucial. Also, 
class-D amplifiers require an external lowpass filter that contains at least one 
inductor which adds to the cost. 

2. Operating Principle 

The core of a class-D amplifier is the switching output stage. A block diagram 
is shown in figure 1 . The switching output stage consists of two large switches 
Sh and Sl and a switch control block. The function of the output stage is a 
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simple one: switching the output node Vom up and down between the supply rails 
VssP and VddP- This results in a square wave signal with a frequency that 
generally lies between 200kHz and 800kHz. The audio signal is embedded in 
the output signal using some form of pulse- width modulation (PWM). Many 
forms of PWM exist [2], all having their specific advantages and disadvantages. 
However, a discussion on PWM is beyond the scope of this paper. 




Figure 1. Cl ass -D output stage 

Ultimately both the audio performance and EMI are largely determined by the 
switching output stage. Accurate control of the switch timing is essential to 
reproduce the PWM signal with sufficient fidelity. Usually, the switch control 
includes a break-before-make arrangement that prevents the power switches Sl,h 
from conducting simultaneously. This results in a period of time where control 
of the output voltage is lost which is appropriately called the dead time. During 
the dead time the output current hut flows through one of the fly-back diodes 
Dl,h^ Dead time is one of the dominant sources of distortion in class-D 
amplifiers [3]. 



3. SOI Technology 

Design starts with the selection of an appropriate technology. For class-D audio 
amplifiers a BCD (Bipolar, CMOS, DMOS) processes is an almost inevitable 
choice. The bipolar transistors are indispensable for analog circuits where noise 
and offset are important. The CMOS allows for integration of modest size logic 
circuits for interfacing and control functions. Finally, DMOS transistors are 
almost perfect power switches featuring high breakdown voltage and low on- 
resistance plus the ability to conduct current in both directions. 

A process that is particularly suitable for switching applications is A-BCD [4] 
which is an SOI based BCD technology. The dielectric isolation between the 
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integrated components makes designs in A-BCD inherently free from latch-up 
phenomena. Consequently, the backgate diodes of the DMOS power transistors 
in the switching power stage can be exploited as fly-back diodes without the 
need for external schottky diodes. In conventional bulk technologies external 
diodes are often necessary to prevent injection of minority carriers into the 
substrate which can lead to latch-up. 




BOX 

Wafer 



Figure 2. Cross-section of a 60V N-DMOS in A-BCD 

The A-BCD process includes DMOS transistors with breakdown voltages 
ranging from 12V to 120V. Further, a range of resistors, capacitors and (zener-) 
diodes is available. A cross-section of a 60V DMOS transistor in A-BCD is 
shown in figure 2. 




Figure 3. Class-D output stage with nDMOS power transistors 

A particularly advantageous feature of A-BCD is the remarkably small reverse 
recovery charge associated with the backgate diodes of the DMOS transistors. 
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The reverse recovery charge is almost an order of magnitude smaller than that of 
comparable DMOS transistors in bulk technology. The small reverse recovery 
charge makes A-BCD especially suited for switching applications since reverse 
recovery is one of the major sources of EMI. 

In integrated circuits n-channel DMOS transistors are often preferred over p- 
channel because of their lower Ron*Area product. A switching output stage with 
nDMOS switches is shown in figure 3. A separate low voltage power supply 
Vaux is used for the driver of the lowside power transistor Mi. An external 
bootstrap capacitor Cboot serves as a floating power supply for the driver of the 
highside power transistor Mh. This configuration enables the gate of the 
highside power transistor Mh to be driven with a voltage higher than the power 
supply voltage Vddp- Each time the output Vout switches to the lowside, bootstrap 
capacitor Cboot is (re)charged through bootstrap diode Dboot- In order to guarantee 
that the gate drive of the power transistors is not disturbed by steep voltage 
transients at the output node Vouh a latch is included in the drivers. The latches 
are set and reset by a switch control unit. The highside latch switches up and 
down with the output node This means that a levelshifter is needed to 
enable the communication between the switch logic and the highside latch. The 
levelshifter has to be insensitive to the voltage transients at the output node Vout- 

4. Robustness in High Voltage Circuit Design 

Robustness is a general requirement for any (integrated) circuit. Three domains 
can be distinguished: normal operation, electrostatic discharge (ESD) and fault 
conditions. 



4.1. Normal Operation 

A circuit should be inherently robust under normal operating conditions. 
Although this may seem obvious this is not a trivial matter in high voltage 
electronics. Usually only a small selection of the available components is 
capable of handling the entire supply voltage. Especially the gate-oxides of 
(D)MOS transistors are vulnerable and precautions have to be taken in order to 
protect them from damage. For this reason the driver of the lowside power 
transistor Ml uses a separate low voltage supply Vaux- The voltage swing at the 
output node can be 60V or more while the gate-source voltage of Ml is limited 
to 12V. Since the bootstrap capacitor Cboot is charged from Vaux the gate-source 
voltage of the highside power transistor Mh is automatically also limited to 12 V. 
Cascodes are widely applied throughout high voltage circuits. The 5V NMOS 
transistors tend to have better noise and matching characteristics than the high 
voltage DMOS transistors. Therefore, current mirrors are often realized in 
NMOS with DMOS cascodes as shown in figure 4(a). Another typical circuit 
structure is the cascoded bipolar differential pair shown in figure 4(b). The 
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breakdown voltage of the NPN transistors in A-BCD is limited to about 18V so 
a DMOS cascode is sometimes required to limit the collector voltage. 

Special attention is needed to avoid floating nodes. Consider the circuit shown 
in figure 4(c). Suppose the gate of DMOS transistor Mi is connected to 12V and 
the drain to 60V. As long as the switch is closed, resistor Ri pulls down the 
source of Mi to at least one threshold below the gate voltage. But when the 
switch is opened the source of Mi becomes floating. The leakage current through 
the backgate diode of Mi pulls the source up towards the drain resulting in an 
increasing voltage across the gate-oxide that eventually leads to destruction of 
the device. This is prevented by the clamp diode Di that limits the reverse gate- 
source voltage. Alternatively, a very small standby current can be drawn from 
the source to prevent the source voltage from floating. 




Figure 4. High voltage circuit techniques (a) currentmirror cascode (b) 
differential pair cascode (c) gate-source clamp 

A robustness issue directly related to class-D is caused by the switching of large 
currents at the output. Unfortunately, DMOS transistors have a rather effective 
bipolar parasite that causes the device to snap back to a highly conductive 
bipolar mode when the breakdown current exceeds a critical level. A typical 
breakdown characteristic of a DMOS transistor is shown as the solid line in 
figure 5(a). The hold voltage Vkou usually lies well below the supply voltage. In 
this mode the current capability is very limited and the device can easily be 
destroyed. 

Consider the situation shown in figure 5(b) where the highside power transistor 
Mh is conducting and a substantial current I„ut flows fi'om the positive supply 
Vmp towards the output This current also flows through the series inductor 
Lh that accounts for the parasitic inductance of the bondwires and leadfingers. 
When the highside power transistor Mh is switched off the current through 
inductor Lh decreases rapidly which causes the voltage at the drain of Mh to rise. 
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At the same time the continuing output current lout pulls down the voltage at the 
source of Mh. The resulting drain-source voltage can easily exceed the 
breakdown voltage BVds of Mh- If Mh snaps back this would destroy the output 
stage. A similar situation can happen with the lowside power transistor Ml. 




Figure 5. High current switching (a) breakdown characteristics (b) parasitic 

inductance. 

The voltage overshoot caused by the series inductors can be limited effectively 
using an ESD diode Zesd as shown in figure 5(b) When the supply voltage 
overshoots, the protection triggers and clamps the drain-source voltages of the 
power transistors to a safe value. The dissipation in the diode Zesd during high 
current switching is a few orders of magnitude smaller than that of an ESD 
discharge since the parasitic inductors are very small and so is the amount of 
stored energy. Of course, the ESD diode Zesd also protects the output stage 
against actual ESD zaps. 



4.2. Electrostatic Discharge 

Robustness against ESD is in general achieved by adding dedicated protection 
devices to a circuit. The purpose of these ESD protections is to divert the 
discharge current and prevent it from flowing through the circuit itself Under 
normal operating conditions the ESD protections should be inactive. The 
breakdown behaviour of Zesd in figure 5(b) is shown as the dashed line in figure 
5(a). The reverse breakdown behavior of the device is characterized by the 
following parameters: 
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BVesd 

Breakdown voltage. Below this voltage the protection does not conduct. 

Trigger voltage/current for bipolar tum-on. Above this current the 
protection snaps back to a state of high conductivity. 

Vhold 

Hold voltage. Below this voltage, the protection shuts down again. 

Vesd(I) 

Clamp voltage at a given discharge current I. This voltage depends on the 
width of the device and determines the ESD level that can be reached. 
Vtz^ItZ 

Fail voltage/current. Above this current the protection is destroyed. 

In contrast to the DMOS transistor breakdown, the ESD protection can conduct 
substantial current after snapping back without being destroyed. Also the 
difference between the trigger voltage Vn and hold voltage Vhoid is much smaller. 
Clearly, the ESD protection devices should be applied so that they do not 
hamper normal operation of the circuit that they are supposed to protect. Thus 
during normal operation, the voltage across the deviee should remain below the 
hold voltage Vhou^ This is especially important for protections that are eonnected 
between supply rails. In the A-BCD process, three protection devices are 
available with hold voltages of about 5V, 8V and 12V. For higher supply 
voltages these device can be stacked. For example, a 32V protection can be 
realized by making a stack of two 12V protections and one 8V protection. The 
parameters of the stack can be estimated by simply adding the parameters of the 
components. In this manner a whole range of ESD protection circuits can be 
tailored for many different situations. 

The ESD protections should be placed such that the discharge current of an ESD 
pulse applied between two arbitrary pins of an IC always flows through one or 
more protection devices and not through the circuit. An exception is made for 
very large components in the circuit such as the backgate diodes of the power 
transistors or the bootstrap diode. These devices are capable of handling large 
currents without being damaged. It is not practical to insert an ESD protection 
between each pair of pins of an IC. A 24-pin IC would require 552 protections. 
The goal is to use a minimal number of protections while maintaining a path 
through one or more ESD protections between any pair of pins. A complicating 
factor in power amplifiers is that usually separate power supplies are used for 
small signal analog, digital and power circuits. Further each channel often also 
has its own power supply. This can lead to intricate discharge current paths. 
Consider for example the circuit shown in figure 6. Here the class-D switching 
output stage shown earlier in figure 2 has been extended with ESD protections. 
Two antiseries 12V protections Zssp connect the relatively quiet digital ground 
VssD to the noisy power ground Vssp. The highside and lowside drivers are each 
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protected by a single 12V protection Zboot and Zaux- The switch control block has 
a separate positive digital supply Vddo and is protected by Zddo^ The protection 
Zesd across the output stage is a stack of five 12V protections yielding a 60V 
protection. Now suppose an ESD pulse is applied between the bootstrap pin Vtoot 
and the positive digital supply Vddo- The discharge path then goes through Ztoot, 
the backgate diode of Mh, Zesd, Zssp and Zddo yielding a total of seven reverse and 
three forward voltage drops which amount to at least 85V. 




Figure 6. ESD current path example 

As is illustrated by this example, the voltages that appear across the components 
in the circuit during a electro-static discharge can be quite different from the 
voltages during normal operation. In this case a careful inspection of the 
levelshifler circuit is required since this circuit is connected to both Vboot and 
VddD- Sometimes, special measures have to be taken to make a circuit more 
robust against ESD. This can be achieved by strategically adding diodes and 
resistors to the circuit. Consider the configuration shown in figure 7(a). Here 
transistor M; can be a current source connected to the negative supply rail Vssa 
and M 2 the input diode of a current mirror connected to the positive supply rail 
VddA- Both transistors have their source and backgate connected. A stack of three 
ESD protections Z; is connected between the supply rails. The diode Di serves 
to block the reverse current path through the backgate diodes of Mi and Mj. A 
discharge current flowing from Vssa to VddA would otherwise flow through the 
two backgate diodes in the circuit instead of the three diodes in the ESD stack. 
Sometimes it is not practical to add an additional ESD protection to protect a 
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single component in a circuit. In that case it is also possible to add series 
resistance in the circuit in order to limit the current when the component breaks 
down. 




Figure 7. BSD measures (a) blocking reverse current paths (b) limit breakdown 

currents 



For example, in the circuit shown in figure 7(b). the voltage that occurs between 
gate and toin of transistor M} may exceed the breakdown voltage of the device. 
As was explained earlier the diode Dj protects the gate-oxide from being 
damaged. Now if M? goes into breakdown the resistor Ri limits the current 
through the device, thus avoiding snapback. Clearly, for the output power 
transistors, adding series resistance is not a viable option. 

4.3. Fault Conditions 

Robustness against a number of fault conditions can be required. In most 
consumer applications the output terminals of audio power amplifiers are 
directly coimected to the loadspeaker connectors. Therefore audio power 
amplifiers should be robust against accidental short circuits that can be made 
between the output(s) and supply lines. Short circuits come in many forms. 
During robustness testing shorts are applied with short and long wires, variable 
resistors (strangle) and inductors between the outputs and supply lines with the 
output signal varying from mute to clipping. In one particularly nasty test a 
contact probe is dragged along a coarse file to emulate an intermittent short. A 
full short circuit protection usually consists of at least an overtemperature 
protection (OTP) and an overcurrent protection (OCP) [5,6]. Overcurrent 
detection can be done efficiently using scaled replica transistors biased at a 
small reference current as shown in figure 8. For the lowside the drain voltage of 
the output power transistor Mi is compared to the drain voltage of replica Mrejtow 
as shown in figure 8(a). The comparison is only valid if Mi is switched fully on 
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in which case Vga/e/ow equals Vaux^ When Ml is switched off the comparator is 
decoupled from the output node by switch Si. 




Figure 8. Overcurrent detection (a) lowside (b)highside 

At the highside the source voltages of the highside power transistor Mh and 
replica Mre/high are compared as shown in figure 8 (b). If Mh is switched on then 
Vgatehigh equals Vboot- Because Vboot is coupled to Vout, both comparator inputs are 
decoupled by switches 82,3 when Mh is switched off 




(a) (b) 

Figure 9 . Supply pumping (a) sourcing current from positive supply (b) 
charging decoupling capacitor negative supply 

A different fault condition can be caused by the class-D amplifier itself Most 
power supplies are only capable of sourcing current. In figure 9 this is illustrated 
by means of series diodes DssP and DddP in the supply rails. Consider the case 
where a current fut is flowing out of the amplifier towards the load. When the 
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highside power transistor Mh is switched on the current can be sourced by the 
positive power supply Vddp as shown in figure 9(a). However, when the lowside 
power transistor Ml is switched on, the current can not be sunk by the negative 
power supply Vs^p and has to flow throught the decoupling capacitor CssP as 
shown in figure 9(b). This causes the voltage across Cssp to increase, which 
unbalances the supply voltage. This mechanism is called supply pumping and 
can lead to destruction of the amplifier if the output current lout does not change 
direction. An overvoltage protection prevents the supply voltage to be pumped 
up to unsafe values by shutting down the amplifier. Also an unbalance 
protection can be desired which limits the amount of unbalance between the 
positive and negative supply voltages. Note that supply pumping does not occur 
in bridge-tied-load (BTL) configuration. 

5. Design of Class-D Output Stages 

As mentioned earlier, one of the dominant sources of distortion in class-D 
amplifiers is dead time. Therefore, minimal dead time is an important issue in 
class-D output stage design. To this end, a thorough understanding of the 
switching dynamics is essential. 



5.1. Switching Dynamics 

The switching dynamics of a class-D output stage can be analyzed with the 
simplified schematic shown in figure 10. The input signals inhigh and W/ow refer to 
the output voltage Vout and negative supply Vssp respectively. The switching 
dynamics at the output Vout depend on the size of the power transistors Mi and 
Mh, driver transistors Mph, Mnh, Mpi and M„/ and on the output current loup At 
first, the output current lout is assumed to be zero. Now suppose that the lowside 
power transistor Mi is conducting and so the output Vout is low. When switching 
the output Vout from lowside to highside the following sequence of events 
occurs. First, signal m/ow goes high and Mi is switched off by discharging the 
gate through lowside pull-down transistor M„/. The output Vout remains low but 
shows a small step caused by capacitive feedthrough. This situation remains 
unchanged during the dead time. After the dead time signal inhigh goes low and 
highside pull-up transistor Mph starts to charge the gate of the highside power 
transistor Mh. As soon as the highside gate-source voltage Vgsh exceeds the 
threshold voltage Vj, highside power transistor Mh starts conducting and the 
output Vout starts to rise rapidly. During the transient of the output voltage Vout 
the highside gate-source voltage Vgsh stalls because all available current Iph from 
highside pull-up transistor Mph is now used to discharge the highside gate-drain 
capacitance Cgdh- The highside gate-drain capacitance Cgdh appears much larger 
due to the well-known Miller-effect. The slewrate of the output voltage is thus 
determined by: 
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dV^_Jph_ 

^gdh 



( 1 ) 



Note that at the same time, this same slewrate appears across the lowside gate- 
drain capacitance Cg^i- Since both power transistors are identical, the current 
through Cgdi equals the current through Cgdh- This current /„/ flows through 
lowside pull-down transistor M„/ and causes the lowside gate-source voltage Vgs, 
to rise. The size of M„/ should be made such that this voltage remains below the 
threshold voltage Vjto avoid cross conduction. 




Figure 10. Simplified Class-D Powerstage 

In other words, the ratio of the on-resistance Rph and R„i of highside pull-up 
transistor Mp/, and lowside pull-down transistor M„i should be such that: 



^nl 

R^i + Rpf, 



< 




( 2 ) 



After the transition of the output has finished, the highside gate-source 
voltage Vgsh continues to rise until the gate of highside power transistor Mh is 
fully charged while the gate of lowside power transistor Mi is (again) fully 
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discharged. The gate-source voltages Fgs/ and Fgs/, and the output voltage Fo„, for 
this transition are shown in figure 1 1 . 




Figure 11. Foliage transients with lout=0 

If the output current lout is not equal to zero the situation changes. Consider the 
case where a small current Fm of about 100mA is flowing into the amplifier. The 
corresponding voltage transients are shown in figure 12. 




Figure 12. Foltage transients with Fut— 100mA 

As can be seen, the gate-source voltages and Fgsh are similar to the case 
where Fut is zero. The output voltage Fout however looks quite different. This 
time, as soon as the lowside power transistor Mi is switched off, the output 
current starts charging the output F,„,. The slewrate is determined by the output 
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current lout and the capacitance at the output node Vout^ After the dead time the 
highside power transistor Mh is switched on and slewrate is now again 
determined by equation (1). This results in a characteristic dual slope output 
voltage from which the dead time can easily be determined. 




Figure 13. Voltage transients with lom^lOA 

As the output current lout increases, so does the slewrate of the output voltage 
Vout during the dead time until is exceeds the slewrate determined by equation 
(1). Consider again the previous example but with a large current lout of about 
lOA flowing into the amplifier. As soon as the lowside power transistor Mi 
switches off the output current lout starts to charge the output node Vout, pulling it 
towards the highside. In this case the slew rate is limited by the following 
mechanism. Due to the increased slewrate at the output Vout the current that is 
forced through the gate-drain capacitance Cgdi is also larger than in the previous 
case. This current Ini flows through lowside pull-down transistor Mni. which 
results in a voltage drop that now prevents the lowside transistor from switching 
off. The lowside gate-source voltage V,s, stalls at a value above the threshold 
voltage Vt. In this case the slewrate of the output voltage is determined by: 

^^out _ ^nl 

dt C^dl 

At the same time the highside power transistor Mh is prevented from switching 
on because the highside pull-up transistor Mph is too small to deliver the 
necessary current. Only after the transition of the output Vom has finished the 
highside power transistor Mh can be switched on. The corresponding voltage 
transients are shown in figure 13. 
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If the direction of the output current lout is reversed the switching dynamics and 
voltage transients as shown in figure 11 remain largely unchanged. The only 
difference is that during the dead time, the output current lout flows through the 
backgate diode of lowside power transistor Ml. Now as soon as the highside 
power transistor Mu is switched on, the backgate diode of lowside power 
transistor Mi is reverse biased instantly. This causes a short but high reverse 
recovery current that flows between the supply lines. Reverse recovery current is 
one of the main sources of EMI in class-D. When switching the output Vout from 
highside to lowside, the same sequence of events occurs with the role of lowside 
and highside interchanged. Now the slewrate is determined by: 

^^OUt ^ ^ pi 

dt 



while the ratio between the on-resistance Rpi and Rnh of lowside pull-up 
transistor Mpi and highside pull-down transistor M^h should be such that: 



Rnh . Vt 

Rnh Rpl ^aux ~~ ^ssP 



( 5 ) 



In practice the drivers of the highside and lowside are made identical. This can 
be done because the highside and lowside driver have about the same supply 
voltage since the bootstrap capacitor Cboot is charged to Vaux when the output Vout 
is low. 




Figure 14. Highside driver undervoltage protection 

When the output Vout is high the highside driver is supplied by the bootstrap 
capacitor Cboot and is essentially floating which poses a potential problem for the 
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class-D output stage. If the voltage Fboot decreases too much the pull-down 
transistor is not low-ohmic enough to keep the highside power transistor Mh 
switched off during a falling edge of the output Vout- In this case both power 
transistors would be conducting simultaneously which could destroy the class-D 
output stage. To avoid this, the highside driver is equiped with an undervoltage 
protection as shown in figure 14. Resistor Ri pulls down the gate of Mi until the 
voltage VboorVout becomes higher than three threshold voltages which is enough 
for proper operation. A hysteresis window around the undervoltage level is 
realized by transistor M 4 and resistor R 2 , 

5.2. Zero Dead Time 

In figures II, 12 and 13 can be seen that the name dead time has indeed been 
chosen appropriately. During the dead time nothing happens. In case the output 
current is small or zero the dead time is merely a waiting state. In case the 
output current is large all events take place immediately after the lowside power 
transistor Ml is switched off, regardless of the dead time. Consequently, the 
dead time can be made shorter without influencing the switching dynamics as 
described before. Consider what happens in the previous examples if the dead 
time would be reduced to zero [7]. In the starting situation lowside power 
transistor Ml is switched on and highside power transistor Mh is switched off. 
Now signal iniow goes high and simultaneously signal inhigh goes low. 
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Figure 15. Zero dead time voltage transients 

Consequently, the gate of lowside power transistor Ml is discharged through 
lowside pull-down transistor M^i while at the same time the gate of highside 
power transistor Mh is charged through highside pull-up transistor Mp^. Because 
of the ratio of the driver transistors the discharge of the gate of lowside power 
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transistor Ml goes faster than the charge of the gate of highside power transistor 
Mh, the gate-source voltages of Ml and Mh reach the threshold level Vj at about 
the same time. At this time the output voltage Vout starts to increase according to 
equation (1) and the behaviour is further identical to that described previously 
The gate-source voltages Vgsi and Vgsh and the output voltage Vout for this zero 
dead time case are shown in figure 15. As can be seen in figure 15, the gate- 
source voltage transients are more or less identical to those in figure 1 1 except 
for the absence of dead time. Apparently, dead time is not necessary for correct 
operation. When the driver transistors have the correct ratio the transitions at the 
output are self-regulating. 




Figure 16. Zero dead time voltage transients with lout=J0A 

This is also true in case the output current lout is not zero. For small output 
currents the voltage transients remain similar to those shown in figure 15. The 
voltage transients for a large output current Fut of about lOA are shown in figure 
16. As can be seen the voltage transients are virtually identical to those in figure 
13. Consequently, it can be concluded that a class-D output stage with zero dead 
time is possible provided that the driver transistors are properly dimensioned 
and the input signals of the drivers are each others inverse. Obviously this latter 
is the most difficult part. What is needed is a means to set and reset the latches 
in the highside and lowside drivers simultaneously, i.e. the set and reset signals 
should have equal delays. For this purpose a fast levelshifter is required. 

5.3. Levelshifters 

A fast and robust levelshifter is necessary to transfer the set and reset signals to 
the latch in the highside driver. The latch is implemented by two inverters 
connected in a loop as shown in figure 17(a). The latch can be set and reset by 
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PMOS pull-up transistors M r. Note that for proper operation, the signals 5 * and r 
are not allowed to overlap by being pulled down simultaneously. The signal 
delay in the levelshifter should not depend on the voltage difference that needs 
to be shifted since the highside driver can be at both highside and lowside 
voltage level. A simple but current-hungry implementation is shown in figure 
17(b). The signals Sa and are meant to be the levelshifted inverse of the set and 
reset signals and are used as inputs for the driver latch. The pull-down 
transistors Mj ^2 have to be dimensioned such that their drain currents cause the 
appropriate voltage drop across the pull-up resistors Rj 2 - For low current 
consumption the resistors need to be high-ohmic which conflicts with the speed 
requirement. 




Figure 1 7. Levelshifting (a) latch (b) simple levelshifter (c) current efficient 

levelshifter 

A more current efficient solution is shown in figure 17(c). The latch formed by 
transistors can be toggled by pull-down transistors Mgj. Transistors 
serve to limit the voltage on the nodes sg and r^. Suppose the reset signal is high 
and the set signal is low. In this case node is pulled down through to a 
level that is one threshold voltage higher than Vout while node r^ is pulled down 
towards V^sd- On the other side node is pulled up towards Vboot by Mj. and Sb is 
pulled up to Sc through M3. In this situation no current flows. As reset goes low 
the voltages remain the same but now nodes rb and have become floating. 

Now as set goes high nodes Sb and Sc are pulled down instantly. However, since 
My is still switched on, node is not pulled down completely but stalls at a level 
determined by the dimensions of M3 and My. In this situation a substantial 
current flows through the branch M3, M3, My. On the other side Mg is switched 
on partially and starts pulling up node and rb. If the dimensions of M^^g and 
M 5 7 are the same, node stalls at the same level as Sc. This situation continues 
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until node rt is pulled up to the level of Then and are pulled up to Vbooh 
My shuts down and Sc is pulled down to one threshold above Vout- These voltage 
transients are shown in figure 18. 




Figure 18. Voltage transients in level shifter 

The signals and rc are meant to be the inverse of the set and reset signals. 
However, as can be seen in figure 18 the signals and are distorted and 
overlap for a significant time which translates to a delay at the latch output Q. 
This delay depends on the voltage level that needs to be shifted and occurs 
because My and Ms are conducting at the same time, counteracting each other. 




Figure 19. Fourstroke levelshifter 
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Note that after the set signal goes high the node Sc is pulled down instantly while 
after the reset signal goes high the node Vc is pulled down instantly. This 
property is exploited by the circuit shown in figure 19. Two signals, precharge 
and discharge, are inserted between the set and reset signals. The four signals 
are subsequently high one at a time in a fixed cyclic sequence. Suppose the reset 
signal is high and the others are low. In that case nodes ri, and rc are at V^sd and 
one threshold above Vou, respectively and all other nodes are at Vbooi- In this case 
only nodes db and dc are pulled up actively by Mj 3 while the others are floating. 
Now as reset goes low, precharge goes high, nodes pb and pc are pulled down 
unhindered since M 14 is off. Consequently, Mg is switched on fully and nodes r* 
and rc are pulled up fast. Next precharge goes low and set goes high and the 
events are repeated with the corresponding components and nodes. In this 
manner it is avoided that pull-up and pull-down transistors counteract each 
other. This arrangement is called the fourstroke levelshifter [ 8 ]. The resulting 
transfer delay is extremely small and almost independent of the voltage level 
that needs to be shifted. 




Figure 20. Voltage transients in fourstroke 

The corresponding voltage transients are shown in figure 20. As can be seen as 
soon as the set signal goes high, the node Sc is pulled down instantly without 
stalling. The same holds for reset and node r^. Not shown in figure 20 are the 
precharge and discharge signals which are high in between of the set and reset 
signals. The precharge signal pulls up node rb before the set signal goes high. 
Consequently, the stalling problem explained earlier is avoided. The discharge 
signal does the same with node Sb- In this maimer the delay between the rising 
edge of the set {reset) signal and the falling edge of the 5c(^c) signal has become 
very small and almost independent of the voltage level that is shifted. For 
robustness, the diodes Dis have been added to the circuit shown in figure 19. 
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Diodes D 1.4 serve to protect the gateoxide of transistors when their 

sources are floating. Diodes D 5 .S block the reverse current through the backgate 
diodes during an BSD discharge from Vssd to Vboot- 




Figure 21. Delay matching highside and lowside 

The fourstroke levelshifter is used for both highside and lowside drivers in order 
to match the signal delays of the set and reset signals as shown in figure 21 . 




Figure 22. State Machine 

The four signals needed for the fourstroke levelshift can readily be generated by 
a simple state machine as shown in figure 22. The state machine runs through a 
cyclic pattern of states thus generating the appropriate driving signals for the 
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fourstroke levelshifters. The highside and lowside are connected such that the 
set signal for the highside is used as reset for the lowside and vice versa. 

Measurements 

The zero dead time arrangement presented previously has been implemented in 
the output stage of an integrated class-D amplifier. In figure 23 the rising edge 
of the output signal is shown. For comparison, also the rising edge of a 
class-D output stage with dead time is shown. As can be seen in figure 23, this 
dead time is approximately 75ns which can readily be determined by the 
characteristic dual slope. The zero dead time version does not show a dual slope 
which means there is no measurable dead time anymore. For this measurement a 
supply voltage of 60V was used. 




Figure 23. Rising edge with and without dead time 

Figure 24 shows the THD+N of a full-bridge zero dead time class-D output 
stage in an open loop configuration. A supply voltage of 30V was used with a 
6Q BTL load. The measurement was done with a IkHz audio signal but 
distortion does not appear to depend on signal frequency. The PWM input signal 
is generated by a digital PWM modulator and has a carrier frequency of 
352.8Khz (8fs). For comparison the THD+N of an almost identical class-D 
amplifier with 75ns dead time is also shown. As can be seen the difference in 
performance is large. The zero dead time distortion stays comfortably below 
0.1% up to the point of clipping making this class-D output stage suitable for 
open loop class-D audio amplifiers. 
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Figure 24. THD+N zero dead time vs. 75ns dead time 

10. Conclusions 

Design of class-D output stages in an SOI-based BCD process has been 
presented. Special attention was given to the various robustness aspects involved 
in the design. A study of the switching dynamics in the class-D output stage 
under various load conditions showed that dead time can be eliminated provided 
that the timing difference of the signals that drive the power transistors can be 
made small enough. A fast levelshifter topology was presented with a delay that 
is independent on the voltage difference that needs to be traversed. 
Measurements show a significant improvement in performance. 
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Abstract 

Nowadays, there is an increasing demand for portable wireless ap- 
plications at a low cost. In particular, CMOS allows the integration 
of mainly digital circuits with the analog transceiver. For reasons 
of cost, reliability, power and size reduction it is desirable to have 
all analog blocks from baseband to RF integrated on the same sub- 
strate. Furthermore, due to the rapid technological change with 
new processes having power supply reduction, it becomes increas- 
ingly difficult to design optimal analog circuit blocks. This is es- 
pecially true in the case of circuits that have to deliver power to the 
output and as such do not gain from a voltage reduction. In mobile 
communications this block is the power amplifier. 

This paper discusses solutions that allow circuits to work with sup- 
ply voltages above the maximum stated by the foundries guide- 
lines. The advantages and disadvantages presented are focused on 
the desired to increase the class E power amplifier efficiency. Tech- 
nological reliability aspects are also discussed. Finally, the paper 
will stress that more complex CMOS technologies are required in 
order to gain from a high supply voltage. 

1. Introduction 

Considerable effort has been put in research to integrate in CMOS full transceivers 
for mobile communications [1]. The final objective is to include a system-on-chip 
with digital processing, analog functions and support blocks. On the analog side, 
the inclusion of the power amplifier is still a rare situation. The need to have high 
efficiency and the required current flowing in it makes that dedicated chips are 
often used. Some of the solutions can include: GaAs [2] and bipolar [3]. Although 
they provide the required output power with high efficiencies, it is nevertheless 
desirable to replace them by solutions using CMOS [4-6]. The advantages come 
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mainly of the current trend to integrate all circuits in the same die for cost but also 
area reduction. 

With more advanced CMOS technologies having a lower power supply it be- 
comes extremely difficult to achieve the high output power required by the com- 
munication standards. This is the reason why there is a lack of offer for the 3 W 
output power level range with integrated power amplifiers in CMOS. Especially 
true for the cases where the previous block has an output power in the range of 
-10 to 0 dBm [1]. As a result, a gain of more than 30 dB is required. In addition 
to this, higher currents are now required to achieve the same output power. The 
result is that wider interconnections are necessary to carry the current and avoid 
electromigration reliability issues. Moreover, with lower voltages -because of the 
higher current flowing, every small resistance has an increased importance. 

Consequently, from the above it seems advantageous to explore high-voltage 
in standard CMOS with the purpose of having a smaller transistor size, increased 
load impedance and lower current in the circuit. Furthermore, this high-voltage is 
already present in the battery. The objective is to have a more efficient amplifier. 

This paper starts by first reviewing in Section 2. the source of losses in the class 
E amplifier. The design and optimization methodology of RE power amplifiers are 
presented in Section 3.. In Section 4., solutions to work at a high supply voltage 
along with a discussion of device reliability are described. Following this, a design 
example is presented in Sections.. Finally, the conclusions are given. 

2. Brief Review of the Amplifier’s Losses 

The basic class E amplifier in Fig. 1 can ideally have an efficiency of 100 %. 
Nevertheless, diverse non-idealities present in the components will cause losses 
decreasing the maximum attainable efficiency. They range from inductor and ca- 
pacitor losses as well the transistor (switch S). In a more realistic implementation, 
this would also include the losses associated to the wiring and taking the signal in 
an out of the chip through the pads. 

2.1. Transistor Losses 

For the transistor, a simple switch S can represent its off- and on-resistance, al- 
though a more complete model can also include losses due to nonzero switching 
time and lead inductance [7]. 

In class E amplifier, due to the switch on-resistance (Ron) power is dissipated. 
The efficiency can be approximately given by [7] 




179 




Figure 1: Ideal class E power amplifier. It consists of switch S, the finite dc-feed 
inductance L\, the shunt capacitor C\ the series tuned (Lq-Cq) in series with a 
reactance jX and the load resistance Rl- 



Rl 

Rj_. “1” 1.365-/?c> 



where 



( 1 ) 



Rl = 



8 V, 



DD 



( 2 ) 



7T^ + 4 Pout 

In the above equation and for a constant output power we have that 

Rl oc vId (3) 

In this case it is interesting to study in which conditions it is possible to increase 
the supply voltage to increase the load resistance and thus the efficiency. 



2.2. Other Losses 

The other important source is the inductor. Bonding wires are still commonly used 
to get the best performance due to their higher quality factor. It has been shown 
that the machine-bonded bonding wires have less than 5 % inductance variation 
and less than 6 % quality factor variation [8] making them suitable for integration. 
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For cost reduction during manufacture, integrating them on silicon is a viable 
solution. However, their low quality factor greatly diminishes the efficiency. The 
inclusion of a resistor in series with the inductor is a simple way of representing its 
losses. More accurate models can be found in [9, 10] for on-chip spiral inductors. 

Capacitors usually have a quality factor high enough in order not to be the 
limiting source of losses. Although this may be true, it is necessary to minimize 
the metal series resistance. Parasitics to substrate can be minimized by either 
using a high-resistivity substrate or by placing a ground shield underneath. The 
latter technique can also used to prevent the substrate resistance from injecting 
noise currents. 



3. Design Methodology 

A close inspection to the circuit of Fig. 1 reveals seven degrees of freedom. And 
although analog designers have clearly derived accurate equations for what ap- 
pear to be more complex circuits, addressing the referred circuit has been a more 
difficult task. Different research has shown that apparently it cannot be explicitly 
solved. Equations do exist but not addressing all trade-offs. 

Equally important is that even for circuits with the complexity of the one given 
by Fig. 1, existing equations [7, 1 1] do not guarantee that the maximum efficiency 
is indeed reached. Or that some of the initial assumptions made prior to equation 
derivation might not lead to the most power efficient design [12]. Furthermore, 
with inclusion of simple parasitics it becomes increasingly more difficult to gen- 
erate equations that are accurate enough. 

Two possibilities will be given that partially address the desires of the analog 
designer: 

# State-space model description 

• Automated sizing using a simulation-based optimization 

The state-space model allows to address the desire that is to have a set of par- 
tially derived equations. Nevertheless, at the end an optimization is necessary to 
find values that originate a steady-state operation. 

On the other hand, initial conditions are not necessary when using a simulation- 
based optimization. Furthermore, any circuit can be optimized being limited by 
the capacity and accuracy of the simulator. The penalty clearly resides in the 
longer CPU time. The continuous increase in computing power alleviates this 
drawback partially. With a simulation-based optimization, a simulation tool is 
used to evaluate the parameter vectors proposed by the optimizer. As such we 
simplify the inclusion of the parasitics. Also, as no preconceptions and restrictions 




181 



are made prior to circuit sizing, the possibility exist that new and more interesting 
solutions can be found. 

3.1. State-Space Model of the Power Amplifier 

The operation of the power amplifier of Fig. 3 can be seen as linear time-invariant 
circuit ^ containing a periodically operated switch Rk with two states in the switch- 
ing period T . In one of the states (ri) it has a small value representing the low 
on-resistance value. In the other one (t 2 ), a high value represents the transistor in 
the off state (Fig. 2). Defining the switching instants = nT, an , 2 = nT + ri, 
and 3 = nT + ri + T 2 = (n + 1)T, a set of state equations of the form 

^n,k — ^k^n,k(t) “I" ^k^iO (4) 

yn,k — ^k^n,k(.t^ "F T)]cU{t) (5) 

^n,k < t < an,k-\-l A: = 1, 2 

can be used to represent the linear circuit. The previous quantities Xn,kit), 
yn,k{t) and u{t) are, respectively, the state, output and input vectors, and Ak, 

Ck, Dk are constant real matrices. 




""n,1 

nT 



(n+1)T 



Figure 2: Notation for the nth switching period. 



The standard continuous system in the form of 
dxn k(t) 

- : = AkXn,k(t) + BkU(t) ( 6 ) 

dt 

in each portion of the «th switching period Xk, k=\,2, has a solution in the form 
of 

Xn,k{t) = f Bku{r})dr} (7) 

Jcm,k 

*The notation of the work of Liou [13] will be followed. 
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since 



f - I) ( 8 ) 

Jcfn,k 

(/ is the identity matrix). The general solution of (6) to an input signal u(t) = u 
at switching instant is given by 

Xn,k(t) = - l)BkU (9) 

Knowing that the values of Xn,k at the end of the on-state are the initial values 
of the off-state, the complete system can be simulated. However, it is of interest 
to find the steady-state for a complete evaluation of the amplifier performances. 
The transient mode can be completely eliminated by setting the initial state vec- 
tor xo,i(0) equal to Ju. Since the state doesn’t changes discontinuously at the 
switching instants, the complex matrix J can be simply written as 

J = {I-M)-^H (10) 

In the former expression, M is a real matrix and equal to: 

H is also a complex matrix expressed as: 

H = - I)Bi + - I)B2 (12) 

Analyzing the circuit from Fig. 3, the Bk, Xn,k and m(^) matrices are respec- 
tively 
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Figure 3: Ideal class E power amplifier as in Fig. 1 with the indication of the state- 
space variables. The switch S is now more conveniently represented by resistance 
Rh 



= lift) 
X2 = Vcft) 
^3 = Vioit) 
X4 = Vco(0 



(13) 



in this case, the state variables are the inductor L\ current, the capacitor C\ 
voltage, the inductor Lq current and the voltage across Cq. During the off state 
the switch exhibits a high resistance Roff^ while during the on state the value of 
Rk is reduced to Ron- 

The remaining matrices, yn,k, Ck and Dk are respectively 



yn,k = [Vo] Ck = [0 0 0 Rl] Dk = [0] 



The above algorithm was implemented in Matlab® and used to optimize the 
circuit from Fig. 1. The initial values for the components can be obtained using the 
formulas presented in [1 1]. Now, we are able to optimize the circuit for minimum 
losses. It is also possible to see the influence of each component separately by 
varying each one of them and observing how the losses change. 
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Moreover, using the above equations, the influence of the series resistance in 
the inductors (Li and Lq) can be studied. The usefulness of this method, however, 
ends here if we also want to 

• Include a more complex model for the inductor [9, 10] 

• Model Cl as the Cgd of a MOS transistor (non-linear) 

• Fully optimize a multistage amplifier 

3.2. Automated Sizing Procedure 

The implemented sizing methodology is a simulation-based optimization approach 
using a differential-evolution optimization algorithm [14]. Any circuit variables 
(device sizes, values, inputs, . . . ) can be selected as optimization parameters. 
Furthermore, one or more optimization objectives (minimize, maximize) can be 
specified as well as a number of (performance) constraints. More information on 
the optimization procedure can be found in [15]. 

In order to facilitate the automated optimization of specific circuit classes con- 
straint and objective templates can be loaded. These could have been stored for re- 
use by the designer himself or provided by another expert designer. Furthermore, 
in order to realistically take parasitics of active or passive devices into account, 
customized device models can be input to the tool. The effective values of the de- 
vices in the model are calculated by the optimizer depending on the optimization 
parameters and by interaction with the device profiler. 

The methodology has been implemented in a software tool called Pamper ^ 
and encompasses a simulation-based circuit optimizer (Dance, which is part of 
the developed software environment presented here) and device profilers that ac- 
curately extract device layout parasitics that are then used in user-defined device 
models within the DANCE tool. 

Some of the features of the proposed methodology and tool implementation 
include: 

• Simulator independence (currently configured to work with ELDO, HSPICE 
and Matlab® ) 

• Knowledge in the form of objectives and constraints is stored in a template 
which can later be reused 

• Replacement of any device by user-defined device models 

^PAMPER stands for Power AMPlifiER. 
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• Information on process variation of the inductors can be taken into account 
during the optimization 

• Integrated support for taking the maximum current that flows in the inductors 
into account during optimization (needed for meeting the electromigration 
constraint) 

Experimental results derived with the software tool, including all layout para- 
sitics, can be seen in [16]. 

4. High-Voltage Solutions in CMOS 

A variety of solutions already exist that allow the designer to increase the supply 
voltage without causing circuit malfunction or device destruction. They can be 
divided in two categories: 

• Circuit breakthroughs where the goal is to correctly define the operating point 
such as the voltage of one transistor terminal to another terminal does not 
reach a critical value. The objective in here is to equally stress the transistor 
without impairing its reliability. 

• Customized silicon technologies. Although more expensive to manufacturer 
is it possible to have breakdown voltages high enough to be useful for the ap- 
plications in question. And at least one solution [17] can be seamlessly inte- 
grated in standard CMOS technologies having local oxidation of silicon (LO- 
COS) or shallow trench isolation (STI) as their preferred isolation method. 

4.1. Circuit Breakthroughs 



Cascode 

The easiest possibility is to simply use the cascode stage (Fig. 4(a)). An example 
of this applied to a class E RF power amplifier can be found in [5]. The main 
disadvantage is that now the voltage swing on the gate-drain of the cascode tran- 
sistor is larger than the common-source transistor. To make them equal, dynamic 
biasing of the drain voltage of the cascode transistor is necessary. 

Self-Biased Cascode 

As proposed by Sowlati [6], the biasing is now implemented using Rb — Cb with 
the increased advantage of not requiring any extra bond pad (Fig. 4(b)). 

For class E, the positive voltage swing around the supply voltage is larger the 
negative swing. Is this case the circuit from Fig. 4(c) is more convenient. In this 
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case Ml and M2 have the same voltage swing at the gate-drain. Thus, hot carriers 
effect is relaxed. As a result, the amplifier works under maximum output power 
without showing performance degradation. 




S1 S1 S1 



(a) (b) (c) 



Figure 4: Various cascade configurations: (a) Conventional, (b) self -biased and 
(c) bootstrapped. 



Non Zero Drain Voltage at T\irn-On 

The last example does not require any change in the circuit [12]. Instead, it prop- 
erly selects another condition for operation. The amplifier electrical diagram is 
thus equal the ideal class E amplifier in Fig. 1. 

Differently from [11] that requires the voltage across the transistor drain has to 
be zero and the slope of the drain voltage should be zero as well to achieve high 
efficiency (5(a)), the amplifier is now designed such that the switch transition from 
the off to on state has a substantial voltage step (Fig. 5(b)). 

If sized as described, for the same output power it is possible to decrease the 
maximum switch voltage. Another advantage is that it has the effect of lowering 
the average current through the switch thus lowering conduction losses for the 
same conduction angle. Experimental results have show that the introduction of 
the voltage step improved efficiency. This effect was also seen with the software 
tool described in Section 3.2.. 
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(a) (b) 

Figure 5: Voltage and current waveforms at the switch, (a) As commonly designed 
[7yll] and (b) with non zero voltage at turn-on [ 12 ]. 



4.2. Customized Silicon Technologies 
Lightly Doped Drain: LDD 

Common on current CMOS technologies, LDD [18] is designed to lower the max- 
imum electric field by having part of the depletion region inside de drain. This is 
achieved by reducing the doping gradient at the gate edge. The gate is surrounded 
by the spacer used to define the LDD region. 



□ 




Figure 6: Device with lightly doped drain implant (n) surrounded by a high-doped 
region (n+) to implement the source and drain region. 



Unfortunately a lightly doped drain is also a lightly doped source. This extra 
resistance increases the channel resistance and also degrades the maximum fre- 
quency that a circuit can operate. The latter is especially important for circuits 
designed for high-speed. 
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Thick Oxide 

The usage of a ticker oxide can be a method to increase the supply voltage without 
causing its disruption. The new device can be seen as a device from a previous 
generation implemented in an advanced technology. By having double oxide tran- 
sistors, the I/O can circuits can use the higher voltage to connect to the outside 
world while high-speed and low power can be obtained with the use of a lower 
supply voltage. 

Reduced Surface Field: RESURF 

Invented by Appels and Vaes [19] in 1979, can be applied in different layers to 
equally distribute the applied voltage laterally across the silicon surface in the drift 
region of the device. The purpose is to move the breakdown from the surface to the 
substrate by depleting the layer where the breakdown was to occur. Symmetrical 
electric field distribution at surface when the doping {Nepi) and the thickness (depi) 
follow: Nepi.depi = 10^^ at/cm^. 

Fully Customized Technology: EZ-HV™ 

Taking the previous approach to its limit, extremely high voltages are possible [20]. 
The possibility to fully control the number of layers, its doping and its thickness 
can drive the specifications beyond what is commonly expected. 

It is a clever way to solve the problem of hot-electrons. With a thick layer of 
silicon, electrons can be accelerated to a point where they have enough energy to 
cause avalanche breakdown. A better way is to simply create a device where elec- 
trons cannot move far enough to acquire the necessary energy to generate current 
by impact ionization [20]. In addition, latch-up immunity is a consequence of the 
oxide isolation used between devices on the chip. Breakdown values exceeding 
600 V with an on-resistance of 7.6 Q/mm^ were measured. 

Lateral Double-Diffused MOSFET 

Perhaps the most common way to increase the breakdown voltage in current tech- 
nologies, is to implement an LDMOS transistor [21,22]. The polysilicon overlaps 
the drift region, acting as a field plate to reduce the electric filed near the gate 
edge [21, 23], thereby increasing the breakdown voltage (Fig. 7). If changing the 
process flow is an option, or the use of more complex and more expensive tech- 
nologies is an options a few other possibilities exist [24-26]. 

Nevertheless, in current technologies the preferred method for device isolation 
is to use a silicon trench around the transistor, the effect of which is that it is no 
longer possible to form a channel under the gate. Consequently, CMOS compati- 
ble high-voltage transistors using extended drain and designed for local oxidation 
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of silicon (LOCOS) technologies [21,22] cannot be integrated with shallow trench 
isolation (STI). 

STI means that the new transistor has to be completely planar. Furthermore, 
the lack of thick oxide means that the field plate technique [21,23] cannot be used 
to reduce the electric field intensity near the gate edge. A cross section of the 
proposed LDMOS [17] structure is shown in Fig. 8. The entire region between 
the drain and source terminal has to be defined as an active area, otherwise an 
oxide trench is created within the drift region. This is in contrast to the procedure 
in [21]. The n“^ implant on the right side of the self-aligned gate is a result that 
is necessary to dope the gate in order to reduce its resistance. The drain is taken 
apart to decrease the electric field within the thin oxide connecting to the gate 
terminal. Finally, the poly over the n-Well is used to create a low doping drift 
region by blocking the n"^ implant over all the n-Well. 

The device uses a few different methods to achieve a higher working voltage. 
To decrease the electric field near the gate edge on the high doping drain side, a 
low doping layer was used. In addition this layer also serves the purpose of de- 
creasing the speed with which the electrons reach the drain lattice. One advantage 
is that the drift region is not contaminated by the thick oxide growth [27]. Also, 
the drift region can now be made smaller. Both characteristics lead to a lower 
on-resistance than in [21]. 




Figure 7: Cross section of the high-voltage transistor with the poly acting as field 
plate [21]. 



4.3. Reliability Concerns 

To maintain performance improvement, some reliability challenges must be ad- 
dressed [18]. The most common failure mechanisms are not of concern to the 
designer if foundry design rules are followed. However, it is of interest to know 
how some of these mechanisms influence and indeed limit the designer freedom. 
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Figure 8: Cross sectional representation of the STI/LOCOS compatible high- 
voltage structure in standard CMOS. 



Time Dependent Dielectric Breakdown: TDDB 

It is the time needed to break an oxide stressed with a high electric field. The 
oxide is stressed as a result of the high-voltage between the gate and the channel 
underneath. Some of its characteristics are: (a) it has a strong dependence of tem- 
perature (exponentially), (b) strongly depends on the area (linearly), (c) depends 
on the duty-cycle, (d) depends on the applied voltage, and (e) is a statistic process. 

Hot Carrier Injection: HCI 

As a result of the high-electric field between the source and drain, the carriers 
are accelerated until the drain becoming ballistic. When reaching the drain, they 
periodically destroy the drain high doped region. The inclusion of the LDD par- 
tially solves this issue. Furthermore, if the gate terminal has a high voltage value, 
the carriers can be deflected and literally thrown towards the thin oxide. In this 
case three things can happen: (a) if they have enough energy, oxide penetrations 
can happen. As a consequence slight variations in Vjh can occur. Moreover, 
this can happen slowly throughout days, weeks or even months depending on the 
energy; (b) if they have sufficiently high energy, the oxide can be immediately 
and permanently destroyed, or (c) they can be deflected to the substrate and cause 
considerable substrate currents. 
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Electromigration 

With integrated circuits becoming progressively more complex, requiring more 
current and at the same time being manufactured in technologies with smaller 
features, originates that metal electromigration is becoming an increasing form of 
circuit malfunction. 

Electromigration is an effect caused by a large number of electrons colliding 
with metal ions causing them to gradually drift with the electric current. Thermal 
energy produces scattering by causing atoms to vibrate. This is the source of 
resistance of metals. Semiconductors do not suffer from electromigration unless 
they are so heavily doped that they exhibit metallic conduction. 

The increase of the temperature or an increase in local current density can de- 
crease the useful lifetime of a circuit. And it has a strong dependence with temper- 
ature (exponentially). Some of its causes can include: (a) an increased intercon- 
nection resistance, (b) a short-circuit with neighborhood metal or (c) originating 
an open-circuit. 

Junction Breakdown 

Junction breakdown is a non destructive effect as long as no large current flows 
in the junction and overheating it. By adhering to foundry design rules this phe- 
nomenon can be disregarded. 

4.4. Short Channel Effect 

In general short channel effects are not destructive but mainly alter transistor char- 
acteristics over time. A good source of information is the book by Tsividis [18]. 
Common technologies suffer mainly from: 

• Drain Induced Barrier Lowering (DIBL) 

• Punchtrough 

• Subthreshold Current 

• Hot Carriers 

Drain Induced Barrier Lowering 

Drain induced barrier lowering is the effect of the drain voltage on the channel. 
The consequence is that channel barrier is reduced when the drain voltage in- 
creases. This dependence on the drain voltage is one of the problems in CMOS 
scaling. There are implications in the carriers’ velocity and also in the efficiency 
of carriers injection from the source to the channel. 
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Punchtrough 

Punchtrough occurs in CMOS transistors when at high drain-source voltage the 
drain space charge region expands over the entire channel width and comes in 
contact with the source depletion region. This can occur either at surface (surface 
punchtrough) or within the bulk (bulk punchtrough). Is this situation a large cur- 
rent can flow from causing device failure. In short channel devices the maximum 
allowed drain voltage is usually determined by punchtrough. 

Subthreshold Current 

A value of 100 mV/dec is a good value for the subthreshold slope. The minimum 
value of 60 mV/dec is usually accepted and CMOS technologies usually have a 
value around 80 mV/dec. The subthreshold slope is a measure of how good a 
device turns off. 

Hot Carriers 

This item is equivalent to the previous HCI. 

5. Design Example. 

5.1. Optimization of the Ideal Class E PA 

Now that we have shown the design strategy for the class E amplifier and also 
how high-voltage operation is possible in standard CMOS, we will proceed with 
the performance evaluation of the circuit in Fig. 1. 

For the specific case of a class E power amplifier (Fig. 1), it is commonly as- 
sumed that having a higher switch on-resistance necessarily implies a less efficient 
amplifier [7]. This is true in the case the shunt capacitor (C\ in Fig. 1) is not the 
parasitic drain-source capacitance and thus can have its value selected indepen- 
dently of the transistor width. To evaluate this effect, a simulation of a typical 
amplifier (Fig. 1) was performed using a differential-evolution optimization algo- 
rithm (described in detail in Section 3.2.. 

In order to show the relation between the inductor quality factor (Ql) and the 
supply voltage, the circuit from Fig.l was simulated with inductors type bonding 
wire (2 l= 40) and spiral inductor on silicon (2 l= 7). The typical transistor (from 
now on referred to as LV-MOS, low- voltage MOS) has a reference on-resistance 
equal to 1. The lateral double-diffused MOS (Fig. 8, [17]) has a measured on- 
resistance 2.3 times greater. The breakdown voltage is 7.5 V for the LV-MOS tran- 
sistor and 14 V for the LDMOS. Because the devices do not have simultaneously 
current and voltage (class E operation) the long term stress due to hot-electrons 
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is minimized. This is the reason the LV-MOS can work outside the manufacturer 
safety region. 




Supply Voltage (V) 



Figure 9: Drain Efficiency as a ffinction of supply voltage for different inductor 
quality factor Ql- Transistor on-resistance for the LDMOS (o) is 2.3 times greater 
than for the LV-MOS j. 



DE = 



Pout 

PSUPPLY 



(14) 



PAE = ~ (15) 

PsUPPLY 

The simulations of the circuit from Fig. 1 were performed using a differential- 
evolution optimization algorithm described in Section 3.2.. All elements were 
selected as optimization parameters. The optimizer first tries to find a feasible so- 
lution (satisfying the class E operation [11]) and then further tunes the parameters 
to optimize for the highest drain efficiency (DE) possible. The simulation-based 
optimization uses SPICE as a proven tool to simulate electric circuits. The main 
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Figure 10: Power Added Efficiency as a function of supply voltage for different 
inductor quality factor Qj^. Transistor on-resistance for the LDMOS (o) is 2.3 
times greater than for the LV-MOS (+ 



penalty resides in the CPU time necessary for each simulation. The continuous 
increase in computing power alleviates this drawback partially. The big advantage 
is the flexibility and accuracy of SPICE. 

The results of these simulations for the case of 

• f=850 MHz 

• inductor quality factor Ql equal to oo, 40 and 7 

• transistor on-resistance for the LDMOS is 2.3 times greater than for the LV- 
MOS 

• capacitor C\ is proportional to the transistor width and equal on both cases: 
LV-MOS and LDMOS 



• 1 W output power 
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are given in Fig. 9 and Fig. 10. From what can be seen, the drain efficiency 
is almost equal and independent of the transistor on-resistance being much more 
dependent on the inductor quality factor. Equally important, is the fact that the DE 
is also independent of the supply voltage. To note is in the case of lower supply 
voltages (imposed by technologies with smaller geometries) it is not possible to 
achieve the desired output power. 

A higher supply voltage does not allow to get a higher DE (Fig. 9). This can be 
justified attending to the fact that [11] 



Rl^ 



8 



''DD 



-\-4 Pout 



Cl = 



Pout 

ttmVId 



(16) 



(17) 



have opposite relationship to the power supply. As the transistor capacitance 
decreases, the load resistance increases with an increasing supply voltage. The 
equilibrium is to balance the losses in a smaller transistor (higher on-resistance) 
with a higher load resistance. 

In the case of the power added efficiency (PAE, Fig. 10), going to higher supply 
voltages, even with higher transistor on-resistance has advantages. This is due to 
the fact that transistor will be smaller (17) and thus, requires a smaller driving 
power. In order not to change the results obtained previously for the DE, the 
average power necessary to drive a capacitive load {Cgg) can be approximated by 

PiN=oi.Cgg.Vlj,.f (18) 

The above a represent a proportionality factor representing an inverter chain 
driving the transistor total input gate capacitance Cgg. 



5.2. Class E PA with an LDMOS transistor 

The device in Fig. 8 was designed with class E power amplifier in mind. Without 
changes in the process flow or any additional mask it is possible to duplicate the 
breakdown voltage of the 0.35 fjbm CMOS technology [17]. Hence, a supply 
voltage of 4 V can be used, resulting in a higher load resistance Rl and a narrower 
transistor. The gain in the transistor size is not enough to compensate the increase 
in the input capacitance per unity width. Consequently, the power necessary to 
drive this output stage will increase slightly. Nevertheless the current flowing in 
the circuit is significantly lower. 




1% 



More complex technologies are required to lower the input capacitance and as 
such profit from a high supply voltage. Minimum process changes in CMOS [17] 
or the use of more advanced technologies [24,26] do allow a reduction in the input 
capacitance. 



5.3. Increasing Efficiency 

In Section 5. 1 . we have seen that the drain efficiency of the class E power amplifier 
is much more dependent on the inductor quality factor than on the switch on- 
resistance. In fact, simulation results have shown that under certain circumstances 
overall drain efficiency can be considered independent of the switch on-resistance. 
Being the size reduction the only benefit from a higher supply voltage linked with 
a lower current in the circuit. 

The simulations in Section 5.1. were done using two different values for the 
switch on-resistance. But they can just as well represent the same device, being 
one narrower than the other. The consequence is that in a multi-stage amplifier 
the driving-stage will now have a smaller load. This in turn originates a smaller 
driving stage. The consequence is a power amplifier having smaller transistors 
and thus saving silicon area. Another advantage is the reduction in the power 
necessary to drive the two stage amplifier. 

As an example consider a two stage amplifier with the goals as described in 
Section 5.1. but only for the case of 2 l= 40. After including all relevant para- 
sitics to the netlist to better represent the silicon implementation and replacing 
the switch by an accurate representation of a CMOS transistor, the circuit was 
optimized. 

• A global optimization of the two amplifier gives a normalized output stage 
transistor width equal to W\ 

• The above value can be reduced without a significative change in the DE. 
This is possible if capacitor C\ is allowed to increase. It can be done by 
increasing the drain-bulk capacitance or by using an extra capacitor. Setting 
Wi=Wil2 and performing another optimization of the amplifier confirms this 
assumption 

• A further reduction in W\ is still possible if capacitor C\ continues to increase. 
The result is an input and output transistor size considerably smaller than 
those available in open literature 

The trade-off the designer must do when considering the above approach is 
that every reduction be a factor of two, increase the RMS current in the circuit by 
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about 10 %. An option that has to be carefully weighted due to electromigration. 
Although in the previous experiment the drain efficiency in the optimized ampli- 
fier is kept almost unaltered, the power added efficiency increases due to a smaller 
input capacitance of the driving stage. 

6. Conclusion 

The current trend of advanced CMOS processes having smaller design features, 
results in a consequent reduction in the power supply. This makes obtaining the 
necessary output power is and will become a difficult task. This is especially 
true if an output power over 30 dBm is required into 0.18 or 0.25 /xm CMOS 
technologies. 

The maximum drain efficiency (DE) of a class E power amplifier is independent 
of the supply voltage. Nevertheless, an increase in the power added efficiency 
(PAE) is possible if the transistor size is reduced. 

To increase the efficiency, inductors with a high quality factor and transistors 
with small input capacitance are a must. The previous factors are more important 
than having a transistor with a lower on-resistance. 

Some of the advantages of a high power supply include: (a) lower current and 
lower losses in the interconnections, (b) smaller transistors, and (c) the possibility 
to have a higher output power. 

High voltage is useful usually at the cost of more complex -and more expen- 
sive, technologies. Power driving circuits do not benefit from the current trends 
towards deep sub-micron and the resulting reduction of the maximum voltage of 
operation. High voltage devices for RF power amplifiers are an advantage as long 
as the input capacitance is comparable or lower than its low-voltage counterpart. 
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Abstract 

This paper focuses on the power management solutions for desktop 
and mobile CPUs, with special emphasis on the control aspects and 
the related control IC requirements and architectures for VRM 
applications. The paper discusses the following topics: power 
converter topologies, load transient response, the ADOPT™ voltage 
positioning technology, maintaining high efficiency over a wide load- 
current range, current and thermal balancing, current-sense solutions, 
protection functions, functional block schematics of VRM control ICs, 
and challenges for the IC designers. 

1. Introduction 

Designing a power supply (often called VRM for Voltage Regulator Module) for 
CPUs (Central Processor Units, or microprocessors) for desktop and mobile 
applications is a challenging task. The microprocessors are highly dynamic loads, 
switching between the minimum and maximum currents with a di/dt that can exceed 
450A/ps at the processor socket [1]. The supply voltages (at the time of writing this 
article) typically fall in the 0.8 to 1.6-V range, determined by a five-to-eight bit VID 
(Voltage Identification) code [2] coming from the microprocessor. (Note: This means 
that the VRM controller has to include a five-to-eight bit DAC.) Dynamically 
programmed lower voltages down to about 0.5 V also begin to appear, especially in 
battery-powered mobile applications, where energy conservation is extremely 
important. Depending on the processor type, the current consumption in high-current 
active mode varies over a wide range, and can exceed 100 A for high-performance 
microprocessors. Further design difficulties arise from the requirement that the CPU 
supply voltage must be a tightly controlled linear function of the load current. This 
means that the power supply must have a specified resistive output impedance, from 
dc to several hundred kHz. Additional difficult-to-meet requirements include high 
voltage step-down ratio, high efficiency down to a fraction of a percentage of the full 
load (especially in mobile applications), differential remote sense, processor power 
sequencing, fast on-the-fly VID programming, and protection against sustained 
overload, overtemperature, output overvoltage, and output reverse voltage. It also 
goes without saying that the cost and size of the power supply should be minimized. 
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This paper presents an overview of the most widely employed power-converter 
topologies for CPU VRM applications, discusses the most important control and 
protection issues with relevance on the architecture of the PWM control ICs, and 
presents the functional block diagrams of two example control ICs for 
microprocessor VRMs. 



2. Power-converter topologies 

Most CPU power supplies are based on the synchronous buck converter (Fig. 1). That 
circuit is capable of efficiently converting the voltage of the primary power source to 
the low output voltage demanded by the microprocessor, with the simplest possible 
topology. (The primary power source is either the +12V or +5V output of the “silver 
box” - the line power supply in desktop computers -, or the battery in mobile 
applications. The battery voltage is typically between 8 and 19V including battery 
charger operation.) 




Fig. 1. The synchronous buck converter. 



The basic synchronous buck converter has several limitations: 

• It has a large input ripple current - this increases the cost of the input filter. 

• Due to the limited rate of change of current in the buck inductor, the load transient 
response will be sluggish - this increases the cost of the output filter capacitor. 

• The output current capability is limited by device availability. 

• The converter has a high hot-spot temperature due to the concentrated heat 
dissipation. 

The output current capability can be increased and the hot-spot temperature can be 
reduced either by paralleling power switches or by paralleling identical converters. 
The load transient response can be improved by reducing the inductance of the buck 
inductor, but that will lead to increased conduction losses and increased turn-off 
losses of the upper (control) FET. The conduction-loss increase caused by reduced 
inductance can be prevented by increasing the switching frequency, but that will lead 
to an increase in the switching losses. There is, however, a technique that can 
overcome all above limitations of the basic synchronous buck converter. That 
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technique combines the idea of paralleling converters with an even distribution of the 
switching instants of the individual converters over the switching period. This 
concept is called multiphase interleaved conversion, and it has been around since at 
least the early 70s [3]. Fig. 2 shows the topology of a three-phase interleaved 
synchronous buck converter. 




Perhaps the most important advantage of the multiphase converter is that the 
equivalent switching frequency increases in proportion to the number of phases, 
without increasing the switching losses. The higher equivalent switching frequency 
means that the equivalent inductance can be decreased without an efficiency penalty. 
Since the inductance is the major limiting factor for a fast load transient response, a 
multiphase converter is expected to produce a better response than its single-phase 
counterpart operating with the same switching frequency. 

Unfortunately, using a multiphase topology is not without its drawbacks. Those 
drawbacks are: 

• More FETs, drivers, and output inductors are needed. (Note, however, that the 
total FET size and inductor volume remain about the same.) 

• The control circuitry becomes more complex. 

• With voltage-mode control the current sharing among the phases is not 
guaranteed. 

• With current-mode control, more than one current sensor might be required. 

Although at present the single-phase or multiphase synchronous buck converter is by 
far the most popular choice for desktop and notebook computers, in multiprocessor 
server applications, where the total processor current can exceed several hundred 
amperes, often transformer-coupled converters are used. The main motivation for 
transformer coupling is to reduce distribution losses and parasitic voltage drops. The 
transformer coupling allows the input voltage for the converters to be raised to a 
higher level (typically 48V). Stepping down such a high voltage to the low voltage 
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required by the microprocessor with a buck converter is impractical due to the 
extremely narrow duty ratio and the related high switching and conduction losses. 
Ref. [4] presents a critical evaluation of six transformer-coupled topologies for such 
applications. Other recommendations for implementing high step-down ratios include 
the usage of high-frequency ac power distribution [5], multiphase tapped-inductor 
buck converters with active clamp [6], and cascade connection of the transformer- 
coupled active-clamped inductorless forward converter and the multiphase 
synchronous buck converter [7]. 

3. The multiphase synchronous buck converter 
3.1. Input and output ripple currents 

A major benefit of the multiphase converters is the fact that both the input and output 
ripple currents are reduced. Fig. 3 shows the rms values of the ac input currents, for 
single-phase, two-phase, three-phase, and four-phase interleaved buck converters vs. 
the duty ratio of the high-side FET, for the case when the ripple currents in the 
inductors are negligible. (The base of normalization is the output current.) It can be 
seen that the rms current in the input filter capacitor of the multiphase converter is 
much smaller than that of the single-phase converter. Also, as the plots in Fig. 4 
indicate, the ripple current flowing in the output filter capacitor is substantially 
reduced. That figure shows the normalized ripple currents of the two-phase, three- 
phase and four-phase buck converter, with the ripple current of the single-phase 
converter used as the basis of normalization. It is interesting to note that both the 
input and output ripple currents become zero at the duty ratios M/N, where M is an 
integer number between 1 and N-1 and N is the number of phases. 




Fig. 3. Normalized rms ac input current of the multiphase buck converter 
v^. the duty ratio (base of normalization: output current). 
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Fig. 4. Normalized output ripple current of the multiphase buck converter v^. the duty 
ratio (base of normalization: output ripple current of the single-phase converter). 



3.2. Theoretical load transient response limits 



The main dynamic requirement for a converter powering a microprocessor is that it 
responds rapidly to step changes in the load current. In this respect, a multiphase 
converter is essentially equivalent to its single-phase version operating at N times 
higher switching frequency and having an inductor with an inductance that is 1/N 
times smaller. 



Load current 



Inductor current 




I 



Capacitor current 




T 

1 




Fig. 5. Waveforms for determining the theoretical load transient response limits. (Vi 
is the maximum voltage available to change the current in the inductor.) 
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The theoretical load transient response limit can be understood by considering the 
waveforms in Fig. 5. The assumptions are that (1) the only parasitic component to 
consider is the equivalent series resistance (ESR) of the output capacitor, and (2) the 
ripple component of the inductor current is negligible. (Note: While the second 
assumption is not always true for a single-phase converter, it is a good approximation 
for multiphase converters, where the individual inductor ripple currents tend to cancel 
each other.) 

It is straightforward to calculate the voltage deviation vs. time, for a step change in 
the load current and for a control that forces the inductor current to change towards 
the new steady-state value as fast as possible [7]. The result is: 

Av„(t) = t^^ + t[^mR,-^j-R,AI (1) 

where C is the capacitance of the output capacitor. Re is the ESR of the output 
capacitor, AI is the amplitude of the load step change, and m is the slope of the 
inductor current. In the case of an upward step in the load current 



m = 




( 2 ) 



and in the case of a downward step in the load current. 




( 3 ) 



where L is the equivalent inductance of the converter, as defined at the beginning of 
this section. 



From (1) the peak deviation is either 

AV = R,AI (4) 



or 



AV = 



AI" mCR" 

+ ^ 

2mC 2 



( 5 ) 



depending on the time constant ReC of the output capacitor. When that time constant 
is larger than lent = Al/m, (4) gives the peak deviation and when the time constant is 
smaller than Xcrit, (5) gives the peak deviation. Due to the fact that the rising slope 
tends to be much smaller than the falling slope, it is possible that the peak deviation 
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for a upward load step is determined by (4) and for the downward load step it is 
determined by (5). 

The significance of the capacitor time constant is that when it is relatively large (as in 
the case of electrolytic capacitors), the peak deviation is determined only by the ESR, 
and does not depend on the inductor current slope. This means that compliance with 
the load transient specifications can be achieved even when the slopes are small, i.e. 
when the switching frequency is relatively low (and the inductance is large), using an 
output bulk capacitor whose size and cost are essentially independent from the 
frequency. On the other hand, when the capacitor time constant is small (as in the 
case of multilayer ceramic capacitors, or MLCCs), the peak deviation depends on the 
slope, and so there is a strong inverse correlation between the capacitor size/cost and 
the switching frequency. This means that MLCCs as bulk output capacitors can only 
be used cost-effectively at high switching frequencies. 

3.3. Duty-ratio considerations for multiphase converters 




Fig. 6. N-phase buck converter with a single pulse-width modulator 
(for non-overlapping conduction of the control FETs). 

Fig. 6 shows a typical control architecture for a multiphase interleaved buck 
converter. In that architecture there is a single pulse-width modulator (PWM), whose 
output pulses are sequentially sent to the control FETs of the individual phases. Fig. 7 
shows another architecture, where there are as many PWMs as phases, and each 
PWM output is sent only to one phase. In this case the PWM carrier waveforms 
(usually sawtooth waves) are shifted from each other by 360®/N. The controller for 
the architecture in Fig. 6 is the less expensive one of the two versions due to its 
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simplicity (only one PWM and one PWM carrier waveform generator). Another 
advantage of that architecture is that in steady state the duty ratios of the control 
pulses for all phases are equal, while in the architecture of Fig. 7 the inevitable small 
differences in the PWM carrier waveforms lead to duty-ratio differences. The duty- 
ratio differences, in turn, can lead to unequal current sharing among the phases. The 
drawbacks of the architecture of Fig. 6 are: (1) It does not allow the maximum duty 
ratio to exceed 1/N, so in applications with low input voltage the number of usable 
phases is limited. For example, with 5V input and 1.5 V output, where the steady-state 
duty ratio is close to 0.3, the maximum number of phases is limited to three. (2) Since 
overlapping conduction of the control FETs is not allowed, the maximum available 
slope of the inductor current for upward load steps is reduced, and actually becomes 
less than the available slope for downward load steps in the input voltage range from 
NVout to 2NVouf In practice, this means that if the input voltage drops below 2NVom 
(e.g. below 9V for a three-phase configuration with 1.5V output), the cost of the 
output capacitors to meet the load transient specifications will be higher than for the 
architecture of Fig. 7. 




Fig. 7. N-phase buck converter with N pulse-width modulators 
(for overlapping conduction of the control FETs). 

4. Control and protection considerations 

4.1. PWM control techniques 

The fundamental task of the controller for a CPU VRM is to ensure the regulation of 
the output voltage. Voltage regulation in a buck converter is implemented with duty- 
ratio control. There are many different types of duty-ratio modulators (often called 
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Fig. 8. Hierarchical chart of 
commonly used single-loop PWM 
control techniques. 



Fig. 9. Hierarchical chart of commonly used 
multi-loop PWM control techniques. 
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pulse-width modulator or PWM), including constant-frequency and variable- 
frequency types, with single-loop or multiple-loop feedback/feedforward control, and 
with or without voltage-error amplifier. Figs. 8 and 9 show the hierarchical charts of 
the various types used in, or considered for, CPU buck converters. Many of those 
control techniques, including the single-loop control techniques with voltage-error 
amplifier, the hysteretic controller without error amplifier, and the various current- 
mode controllers, are described in [9]. The Vsquare control is described in [10]. 

Recently, the idea of digital control of VRMs has received increased attention [11]- 
[14], and at least one vendor released a chipset with digital control for such 
applications [15]. Whether digital control of VRMs is a more cost-effective way of 
achieving compliance with the required specifications than analog control still 
remains to be seen. 

4.2. Implementing the load line or voltage positioning 

The microprocessor specifications require that the output voltage be a well-defined 
linearly decreasing fimction of the load current (see, e.g. Figs. 1 and 2 of [1], or 
Section 2.2 of [2]). This requirement is equivalent to having a resistive output 
impedance over a wide frequency range. The resistive output impedance allows the 
output voltage to decrease, or droop, in proportion to the instantaneous processor 
current. The droop is often called “voltage positioning,” due to the fact that heavy 
load positions the output voltage near the lowest voltage where the CPU is still 
operational, and light load positions the output voltage near the maximum voltage 
acceptable for the CPU. The optimized voltage positioning (the one where the bulk 
capacitor is selected to have an ESR to be equal to the resistance represented by the 
load line) provides two major advantages: (1) It allows the minimization of the bulk 
output capacitor by essentially reducing the maximum peak-to-peak voltage deviation 
to be equal to the product of the maximum load step magnitude and the ESR of the 
bulk capacitor, at any pulse width of the load current step. (2) By allowing the output 
voltage to sag to its specified minimum value at full load reduces the average 
dissipation in the microprocessor. 

There are several ways of implementing voltage positioning. The first, widely used, 
solution was adding a “droop” resistor with a resistance equal to the value 
represented by the required load line between the inductor and the bulk capacitor, and 
taking the voltage-regulating feedback from the junction of the inductor and the 
resistor (Fig. 10). Unfortunately this, traditional, approach has several drawbacks, 
including the cost of the high-current precision resistor, reduced efficiency, and most 
importantly the fact that the transient response to a step load change shows a spike- 
back, which, in case of narrow load pulses, prevents the output voltage to remain 
between the upper and lower tolerance limits of the load line. (This issue will be 
discussed in more detail in the next subsection.) 
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Fig. 10. Traditional implementation of the voltage positioning using a resistor 
between the inductor and the bulk capacitor. 




Fig. 11. Implementing ADOPT^^ voltage positioning in a current-mode controlled 

converter. 

The next three methods (collectively called ADOPT^^, for Analog Devices’ Optimal 
Positioning Technology) are capable of alleviating or completely avoiding the 
drawbacks of the droop resistor technique. They are implemented by using either (1) 
a current-mode-controlled converter with an optimized compensation of the voltage- 
error amplifier (Fig. 11, [8], [16], [18]), or (2) a converter with fast voltage regulation 
(e.g. a hysteretic regulator) complemented with a current sensor and an external RC 
network between the output, the reference voltage and the noninverting input of the 
PWM controller (Fig. 12, [16], [19], [25]), or (3) a voltage-mode-controlled 
converter, in which the total inductor current signal is subtracted from the reference 
voltage and the voltage-error amplifier is optimally compensated (Fig. 13, [17], [20]). 
All three ADOPT'^^ methods allow setting the output impedance of the converter to 
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be equal to the ESR of the bulk capacitor over a wide frequency range, thereby avoid 
the spike-back and permit the minimization of the size and cost of that capacitor. 
These methods can also be used in the cases where the ESR is smaller than the 
specified load-line resistance or where a substantial amount of low-ESR capacitors 
(typically multilayer ceramic capacitors for local bypassing around the 
microprocessor) is placed in parallel with the bulk capacitor. 




Fig. 12. Implementing ADOPT^^ voltage positioning in a converter with hysteretic 

regulation. 




Fig. 13. Implementing AD OPT^^ voltage positioning in a voltage-mode controlled 

converter. 
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4.3. Load transient response 

The reason why the traditional voltage positioning technique discussed at the 
beginning of the previous subsection does not work well with narrow load pulses can 
be understood with the help of Figs. 14 and 15. Fig. 14 shows the details of the spike- 
back for the case of load-current step-up. Initially the voltage drop is equal to the 
product of the current step Alout and the ESR of the bulk capacitor, but then the 
voltage-regulating feedback loop attempts to bring back the voltage rapidly to the 
value corresponding to the product of Alout and the parallel equivalent of the ESR, Re, 
and the droop resistance, Ryp. The speed of the voltage rise is limited by the available 
rate of inductor current change (di/dt), so the peak of the spike-back voltage is 
delayed by a time interval that depends on di/dt. Eventually the output voltage will 
settle at the new value determined by the product of Alout and Ryp. 




Fig, 14. Details of the spike-back waveform. 





Fig. 15. Waveforms with traditional voltage positioning. 
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Fig. 15 shows what happens when the load current pulse is narrow and the peak or 
valley of the spike-back coincides with the termination of the pulse. Clearly, 
overshoot and/or undershoot appear, which increase the peak-to-peak deviation. The 
worst-case deviation can be almost twice the product of the current step amplitude 
and the voltage positioning resistance. 

In the case of ADOPT^^, the load transient response is an almost ideal step, as can be 
seen in Fig. 16, which shows the measured load-current step response of a CPU 
converter on two different time scales. 
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Fig. 16. Measured output voltage response of an ADOP'H^ converter to a 14A step 
change in the load current. Top time scale: 10 jus/div, bottom time scale: 100 jus/div. 

Voltage scale: 20 mV/div. 



4.4. Maintaining high efficiency over a wide range of load-current variation 

In mobile applications it is important to maximize the battery lifetime. This effort is 
helped by the improved power management of the newer generations of mobile 
microprocessors (IMVP, QuickStart, DeepSleep, Enhanced SpeedStep, Thermal 
Throttling, see, e.g. [21]-[23]). It is also important that the losses in the various power 
supplies of the computer (including the VRM for the CPU) be minimized. Those 
losses are around 10% of the total power consumption of the computer. Since in 
battery-powered operation the microprocessor spends most of the time in low-power 
mode, it is important for the VRM to maintain high efficiency down to a fraction of 
one percent of the full load. In addition to optimized design of the power section of 
the converter, the high efficiency at light load can be achieved by a combination of 
essentially two types of power-saving measures, (1) decreasing the switching losses, 
which tend to be dominant at light loads, and (2) eliminating the circulating current 
caused by the reverse conduction of the low-side, synchronous rectifier, FET when 
the load current drops below the value of critical conduction (i.e., when the inductor 
valley current reaches zero). 
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Above the load current corresponding to critical conduction, the switching-loss 
reduction can be achieved by phase shedding. Phase shedding means that out of the N 
phases of the multiphase converter one or more phases are turned off whenever the 
load current drops below a certain level. The drawback of phase shedding is that the 
output ripple voltage increases. 

If the load current drops below the critical conduction, the losses caused by the 
circulating current can be reduced by turning off the synchronous rectifier in the 
instant when the inductor current reaches zero. In this case the synchronous rectifier 
essentially emulates a true diode, i.e. it does not allow the current in it to reverse (see 
Fig. 17). With diode emulation below the load current corresponding to critical 
conduction, the converter operates in discontinuous inductor-current mode (DCM), 
and the rms currents in the inductor and also in the power FETs are reduced, and so 
the conduction losses are cut back. 




(a) synchronous operation: the low-side FET is always on 
^ when the control FET is off 





(b) asynchronous operation: the low-side FET emulates diode 
Fig. 1 7. Inductor current waveforms at light load. 

In DCM the switching losses can be reduced by frequency modulation, without the 
penalty of increasing output ripple. By making the switching frequency proportional 
to the load current, the efficiency can be made essentially independent from the load 
current, at least until the bias power consumptions of the controller and driver ICs 
become comparable to the output power. 

Fig. 18 shows the calculated efficiency of a typical 300kHz four-phase converter with 
diode emulation, in four-phase operation and in single-phase operation (i.e. after 
three phases were shed). Fig. 19 shows the calculated efficiency of a typical 300kHz 
single-phase converter in three cases, (1) without diode emulation, (2) with diode 
emulation but without frequency modulation in DCM, and (3) with both diode 
emulation and frequency modulation in DCM. In the last case the switching 
frequency is proportional to the load current. 
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Fig. 18. Calculated efficiency of a 300kHz 
four-phase converter with diode 
emulation, in four-phase operation and 
after three phases were shed, (i.e. in 
single-phase operation). 
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Fig. 19. Calculated efficiency of a 
300kHz single-phase converter without 
diode emulation, with diode emulation, 
and with diode emulation and frequency 
modulation in DCM (fsw ~ hut)- 



4.5. Current/thermal balance in multiphase converters 

Imbalanced operation in multiphase converters can manifest itself in uneven current 
sharing and/or in uneven junction temperatures of the power FETs. The causes of 
imbalances can be (a) duty-ratio differences caused by either unequal PWM sawtooth 
amplitudes or uneven propagation delays, (b) differences in the channel resistances of 
the power FETs, or (c) thermal-resistance differences. Duty-ratio differences lead to 
unequal current sharing among the phases. Employing current-mode control or 
adding the current information to the PWM sawtooth in voltage-mode control are 
effective ways to balance the currents. The problem with these approaches is, 
however, that if there is a difference in the Rds(on) values or in the thermal resistances 
among the phases, then current balancing will exacerbate the temperature differences. 
The temperature differences caused by the Rds(on) or thermal-resistance differences 
can be reduced by allowing the positive temperature coefficients of the FET channel 
resistances (about 0.4%/°C) equalize them. The higher the temperature of a FET is, 
the higher its resistance will be, which will reduce the current flowing in it. The 
optimal solution seems to combine the two balancing techniques, i.e. to trade some 
current balance for thermal balance by using voltage-mode control with a small 
amount of current signal added to the PWM sawtooth. The best thermal balance is 
obtained if the voltage drops across the conducting FETs are used as current signals. 
The drawback of a good thermal balance is, however, that it forces uneven phase 
currents, therefore the inductor sizes must be increased in order to be able to handle 
the highest possible current without entering in saturation. 



4.6. Current-sense solutions 



The load-line specification requires the controller to receive accurate information 
about the total inductor current. A straightforward way of generating the current 
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signal is to use a current-sense resistor between the common junction of the inductors 
and the output bulk capacitor. The drawbacks of that solution are cost, extra 
dissipation, and layout difficulties. A more practical approach is to use the winding 
resistances of the inductors. The current signal information can be extracted by using 
an RC filter across the inductor with a time constant that matches the L/Rwinding time 
constant of the inductor [24]. Fig. 20 shows the implementation of the idea in a two- 
phase converter. Unfortunately, sensing the voltage drop across the winding 
resistances introduces a positive temperature coefficient, and this makes it difficult to 
meet the accuracy requirements. Fig. 21 shows an economical way of canceling the 
temperature coefficient of the winding resistances with a single NTC resistor in the 
feedback network of the current-sense amplifier [17], [20]. 




Fig. 20. Sensing the total inductor current in a two-phase converter, using the 
winding resistances of the inductors. Two RC circuits with a shared C recover the 

total inductor current signal. 

4.7. Optimal tri-loop control for multiphase converters 

Fig. 22 shows a tri-loop control architecture for multiphase voltage-mode converters 
[17]. The three control loops are (1) a common feedback loop for output voltage 
regulation, (2) individual phase current loops for current/thermal balancing, and (3) a 
common impedance loop to implement the required load line. 
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Fig. 21. Sensing the total inductor current in a two-phase converter, using the 
winding resistances of the inductors. A single NTC resistor cancels the temperature 
dependence of the winding resistances. 

The voltage-regulating feedback loop has high dc gain for accurate tracking of the 
difference between the reference voltage and the total inductor current signal, and is 
compensated for ADOPT™ load transient response according to the equations in Fig. 
13. The phase current loops are using the voltage drops across the synchronous 
rectifier FETs, thus providing the best trade-off between current and thermal balance. 
The common impedance loop is implemented by subtracting the total inductor current 
signal from the reference voltage. That current signal can be generated either by a 
sense resistor between the junction of the inductors and the bulk capacitor or by using 
the winding resistances of the inductors, as per Fig. 21. The tri-loop control of Fig. 22 
represents the optimal solution for multiphase voltage-mode converters for VRM 
applications, and is implemented in the ADP3168 IC from Analog Devices. 

4.8. Protection considerations 

It is expected that the VRM is (a) protected against sustained overload at the output, 
and (b) designed such that under no circumstances does an overvoltage or reverse 
voltage appear on the output. 
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Fig. 22. Optimal tri-loop control of multiphase converters in VRM applications. 



The overload protection can be continuous (cycle-by-cycle) type, hiccup-type, or 
latch-off type. Each type has its set of advantages and disadvantages. The continuous 
protection provides safe start-up with any output capacitance, and if the overload is 
removed the output recovers without the need for recycling the input supply. On the 
other hand, the continuous operation during sustained overload leads to hot spots and 
a complete battery discharge. Also, the VRM can overheat if the output current limit 
is not well controlled. (Note: Foldback current limit helps preventing the 
overheating.) The hiccup protection reduces the average current by the hiccup on/off 
ratio, so hot spots and overheating are avoided. Also, if the overload is removed the 
output recovers without recycling the input supply. The drawbacks of the hiccup 
protection include the danger of start-up difficulty with large output capacitances and 
the additional complexity. The latch-off type protection completely eliminates the 
power consumption after the latch-off, but can fail to start with a large output 
capacitance; also, it does not provide automatic recovery after the overload is 
removed. 

The most likely causes of an output overvoltage are sudden load removal (note: this 
should never happen in a VRM that complies with the voltage regulation 
specifications) or a drain-source short of the high-side FET. The VRM specifications 
expect that if the output is above the maximum VID level by a certain margin 
(usually 200 mV), or above a set absolute voltage specified for the CPU, the VRM 
shuts off the supply to the processor. This should be done by an output overvoltage 
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protection (OVP) circuit with a path separate from the voltage sense path used for 
voltage regulation. The concern here is to avoid false tripping due to external noise or 
an inaccurate threshold. Also, the protection circuit should enable a low-resistance 
path (a crowbar) to ground with a sufficiently fast response time such that if the high- 
side FET shorts to the input power source then the output voltage will not exceed the 
maximum specified voltage for the processor. The crowbar is usually the low-side, 
synchronous rectifier, FET. When the overvoltage happens due to the failure of the 
high-side FET, the crowbar is turned on by the OVP circuit and blows a fuse, A 
potential problem is that if the fuse is replaced without replacing the high-side FET 
and the system is turned on again, the supply voltage for the control IC comes up 
slowly and the processor sees the full input voltage through the shorted high-side 
FET before the crowbar could activate again, and this destroys the processor. A fail- 
safe solution is to use a dedicated crowbar FET directly across the output that runs 
from an “always-on” rail and is turned on by default when the system comes up. The 
dedicated crowbar FET is also effective against reverse voltage caused by the current 
reversal in the inductor when the low-side FET is shorted. In that case the reverse 
current leads to a voltage reversal across the output capacitor and possibly to the 
destruction of the processor. Fig. 23 shows the combined implementation of 
overvoltage and reverse-voltage protection with a dedicated crowbar FET in a single- 
phase converter with hysteretic ripple-current control and ADOPT™. 

Ml 




Fig. 23. Combined implementation of overvoltage and reverse-voltage protection 
with a dedicated output crowbar FET. 
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5. Example control IC architectures 




Fig. 24 shows the functional block schematic of a dc-dc controller IC, the ADP3205 
from Analog Devices [25] developed for notebook computer applications. The device 
utilizes hysteretic ripple current control, which is a combination of the hysteretic 
regulator principle with ADOPT™ voltage positioning. Its features include the 
following: 

• Pin-programmable one-, two-, or three-phase operation 

• Compliance with Intel IMVP-IV specifications 

• Inherent static and dynamic current sharing 

• ADOPT™ voltage positioning 

• Noise blanking after a leading edge to avoid false switching due to noise pickup 

• Diode-emulation control of the low-side FET for optimized light-load efficiency 

• Six-bit VIDDAC 

• Soft DAC output voltage transition for dynamic VID change 

• Masked Power Good during output voltage transients 

• Cycle-by-cycle current limiting with latched or hiccup overload protection 

• Soft start-up 

• Two-level overvoltage and reverse- voltage protection 
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Fig. 25. Functional block diagram of the ADP3168 IC. 

Fig. 25 shows the functional block schematic of the ADP3168, also from Analog 
Devices [20]. The device was developed for desktop dc-dc converter applications, 
and utilizes the optimal tri-loop control architecture discussed in Section 4.7. Its 
features include the following: 

• Pin-programmable two-, three-, or four-phase operation 

• Compliance with Intel VRD-10 specifications 

• IMHz per phase switching frequency 

• Active current/temperature balancing 

• Temperature-compensated current sensing, using the inductor winding resistances 
as sense resistors 
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• ADOPT™ voltage positioning 

• Six-bit VID DAC 

• Built-in Power-Good and crowbar blanking to support on-the-fly VID code 
changes 

• Programmable overload protection with programmable latch-off delay 

6. Challenges for the designers of VRM control ICs 

The design of control ICs for VRM applications is a challenging task. The challenges 

include, but are not restricted to, the following: 

• Low cost and extremely short development schedule 

• High-accuracy band-gap reference [less than ±1% error over wide temperature 
range (0 to 100 C) and over process variations], with good rejection of the input 
ripple voltage (40 dB or better) at the effective switching frequency, and with 
floating ground arrangement to comply with the remote sensing requirements 

• VID DAC with a resolution up to eight bits, with monotonicity and good linearity, 
and with controlled slew rate to comply with VID on-the-fly specification 

• Voltage-error and current-sense amplifiers with low offset (less than ±1-2 mV) 

• In mobile applications: Low supply voltage (3.3V nominal), with full functionality 
down to around 2.7 V 

• Complex system interface (PowerGood signal, power-saving control features, 
VID on-the-fly control, soft transition between active and power-saving modes for 
low acoustic noise) 



7. Summary 

This paper discussed the converters used for notebook and desktop CPU VRM 
applications, with emphasis on the control aspects having relevance on the 
architecture of PWM control ICs. The discussion included the review of power 
converter topologies, some of the important characteristics of the multiphase buck 
converter, the various PWM control techniques, the implementation of the load-line 
specification, the load transient response and the related ADOPT™ voltage 
positioning technology, the techniques to maintain high efficiency over a wide range 
of load-current variation, current and thermal balancing, current-sense solutions, and 
protection against overload, output overvoltage and output reverse voltage events. 
The paper concluded with the presentation of the functional block schematics of two 
control ICs for microprocessor VRMs and with a summary of control IC design 
challenges. 
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Abstract 

RF circuits have very specific requirements that they impose on 
the power supply. This paper discusses these requirements. The 
different ways of generating such power supply in a suited way 
for RF circuits is also discussed. A large section of this paper is 
taken up by an example RF circuit that has ultra-low power and 
miniature size requirements. This makes that this circuit needs to 
run directly from a single cell battery. The large impact his has on 
design choices is discussed. 



1. Introduction 

A little more than one decade ago wireless communieation did start its strong 
uprise. The cellular mobile phone system GSM is of course the clearest example 
of this, but since then many other wireless applications and standards, such as 
WLAN and BlueTooth, have emerged. In all these applications battery lifetime 
is a most critical property for the mobile stations and a search for its constant 
improvement is always ongoing. The power profile of a wireless circuit or 
system is however much more than the sum of the active currents of all its 
subcircuits multiplied with the batteries voltage level. The first part of this paper 
will discuss the different issues involved with the selection of the power supply 
voltage for the different parts in wireless system. The second part describes the 
different circuits techniques that can bridge the difference between the battery 
properties and the power supply needs. The following section goes briefly into 
the dynamic behavior of the power consumption in a wireless system. Finally, 
an example of a wireless system is presented that requires an ultra low power 
consumption in a minimum space. One of the consequences of these extreme 
requirements is that this RF chip does need to run directly from the battery. The 
impact of this on the power profile of the chip is discussed. 
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2. Choosing the power supply voltage 

The properties of analog and RF circuits will vary widely with a variable power 
supply voltage. In fact, this is maybe even more so for digital circuits, where the 
power consumption and speed performance roughly vary with the square of the 
power supply voltage. The difference will be however that for some analog 
circuits you may see degradation in both directions when deviating from the 
optimum power supply level. The result is that in general a good quality RF 
circuit cannot run directly from the battery. A voltage regulator will be needed 
that keeps the power supply constant within a small range. This section will 
discuss the issues related to power supply regulation and the optimum choice of 
the power supply level. 



2.1. Battery properties 

Fig. 1 illustrates the large variations in voltage level over the battery life time 
that can be expected [1]. These example discharge curves are given for a 
number of the most common used battery types for wireless applications. 
Although these curves show a rather flat response for most of the time, the 
designer has to take into the peak voltage that will be present at start. There is 
some more flexibility on which point will be defined as fully discharged, but 
even so the ratio between the highest and lowest possible voltage level can be a 
factor 1.5 to almost 2. This is an enormous variation for analog and RF circuits 
to cope with. 




Fig.L Example discharge curves of commonly used battery types. 
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An equally important aspect that will prevent direct operation from the battery 
is its internal resistance. The battery’s internal resistance makes that variable 
currents that are drawn by the wireless device will be converted in a ripple on 
the battery level. The consequence is that very strong power supply rejection 
ratio requirements need to be imposed on those circuits that would operate 
directly from the battery. 



2.2. Circuit requirements 

Different RF circuits have different power supply needs. A specific example is 
for instance an LC VCO. Although the VCO’s varicap uses preferable a rather 
large voltage range in order to keep its sensitivity low, the VCO core itself is a 
block that does not require a large power supply voltage [2]. Fig. 2 shows the 
basic outline of a LC VCO circuit. As power supply voltage it requires only the 
Vgs values for the cross-coupled negative gm plus the Vosat of the current source. 
Large signal swings will be generated in the oscillator, but this has no influence 
on the power level requirement. The consequence is that in today’s mainstream 
technologies an LC VCO will only require a 1 V to 1.5 V power supply. 
Anything in excess of that will result in a loss of power in its current source. As 
an alternative, LC VCO topologies can be used that take into account the 
available voltage headroom by placing both and nMOS and pMOS negative gm 
pair [3], [4]. The pMOS pair will however not achieve the same efficiency for 
the power supply drop it uses as the nMOS pair. 




Fig,2. Basic topology of a differential LC oscillator. 
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A similar story goes for the LNA. An LNA is basically designed to pick up very 
small signals and amplily these. Even after amplification its output swing will 
remain limited even for the highest input signal level that must be tolerated. 
Especially when an inductive load is used, a 1 to 2 V power supply may be 
sufficient for the LNA [5]. In some cases this property has been used to save 
power by stacking different stages on top of each other. Fig. 3 gives an example 
of an LNA circuit that uses this property. A source follower buffers the load 
capacitance in this topology such that a high bandwidth can be obtained, even 
without inductive load. 




Fig. 3. LNA topology that re-uses the current. 

A downconversion mixer and its succeeding IF filters have a different 
dependency on the power supply. In this case it is the achievable signal swing 
that must be maximized and at low frequencies this implies using the largest 
possible power supply level [6]. A larger signal swing will allow for equally 
larger noise levels in these blocks and thus directly for a lower area and power 
consumption. Without going into much detail on specific types of circuits in RF 
chips, it is clear that different part may have different power supply 
requirements. 
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3. Generating the power supply for RF circuits 

Most wireless circuits will require some extra circuitry to manage the power 
supply level [1],[7]. The main function of such circuit will be to regulate the 
voltage that is provided to the wireless chip such that is kept within small 
boundaries. Maximum variations of ± 5 % or ± 10 % are common. A second 
function of a power management circuit is making sure that the power supply 
signal is a clean signal. This means it needs to have a low internal impedance 
over a wide frequency range such that current variations are not translated into 
voltage variations. It also means removing noise that is present on the 
unregulated power supply level. The use of different regulators for different 
parts of the system allows for an isolation between them. 



3.1. Linear regulators 

Linear regulators are the most easy to implement type of regulators. In many 
cases, depending on the current they have been designed for, they can be 
implemented onto the same chip as the RF circuitry, which makes them also the 
cheapest implementation of a regulator. 

There are two major drawbacks for a linear regulator: its efficiency and the fact 
that the supply level at its output can only be lower than the lowest possible 
input level. The latter means that a battery that can go as low as 0.9 V in an 
almost discharged configuration will only yield a linear regulated power supply 
of 0.8 V for the complete operation time. The efficiency of the regulator is 
already reduced due to the variable voltage drop over the regulator that needs to 
carry the current that is consumed by the active circuit, but on top of that there 
is the increased current that will be needed for an analog and RF circuit in order 
to give it the same speed and accuracy performance on the low power supply 
level of e.g. 0.8 V [6], compared to a possible much higher voltage that is 
tolerable by the technology. For this reason, linear regulators are in power 
consumption critical wireless systems mostly only used for specific blocks that 
require an improvement of their PSRR. The clearest example is the regulation 
of an LC VCO. As indicated before, an LC VCO does not require a large power 
supply voltage, while it is most critical to picking up power supply noise since 
this directly translates into spurii on the LO signal and thus a reduction of the 
bit-error-rate or the selectivity. 

Part of the previous efficiency argument disappears when the power supply 
level that is tolerable by the technology is close to the low power supply level 
that the regulator generates. For the example of 0.8 V given above, this becomes 
true for e.g. a 0.13 pm and certainly a 90 nm CMOS technology. For very cost 
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sensitive high- volume applications in which a fiill single chip implementation is 
a necessity, like e.g. BlueTooth [8], the use of an on-chip regulator built with 
thicker oxide devices for the complete RF part becomes an option. For the 
digital part of such a chip a switching regulator is more suited since its 
switching noise is in that case no problem, such that the higher efficiency can be 
fully exploited. It remains however doubtful that there is fact a business case for 
many single chip wireless systems in a 90 nm technology. The chip area taken 
up by the digital part may in such a technology become only a fraction of the 
RF and analog chip area and since the latter do not scale well between 
technology generations, there is not a large cost advantage to be expected from 
moving to the more fine line technology, while development cost rise of course 
sharply. 



3.2. Switching regulators 

An alternative to linear regulators are switching regulators. In a switching 
regulator a power transistor will in combination with an inductor, convert an 
input voltage level into a different output voltage. Pulse width modulation 
allows to make the ratio between input and output variable and thus to regulate 
the output level. The main advantage of the switching regulator is its efficiency. 
Values better than 90 % are commonly achieved. The drawbacks of the 
switching regulator are twofold: cost and switching noise. The switching 
regulator will require the use of a good quality inductor, a large decoupling 
capacitor and likely an off-chip power transistor. The switching process 
generates spurii onto the regulated power supply that cannot be removed 
completely. Higher switching speeds reduce the size of inductor and capacitor 
that is needed, but due to a reduction of the efficiency this remains limited to 
values of around 1 MHz, a frequency that is still too low to remove efficiently 
with proper filter choices in the RF part. RF blocks that have a rather large 
power consumption requirement will be the preferred parts to be powered from 
an inductive switching regulator. A power amplifier for long distance wireless 
systems, like GSM, that can transmit up to 2 W, is a good example. 

Lower cost implementations may use a capacitive switching regulator [1]. They 
are limited to lower current levels, have a more worse efficiency and they will 
also generate switching noise, although less than an inductive regulator. 



4. Dynamic power consumption 

Different from many other applications, wireless systems have in general a very 
dynamic power consumption behavior. First of all there is the difference 
between an active communication mode and a stand-by mode. In the stand-by 
mode a channel is only check from time to time to verify if any relevant 
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information is present for the receiver. Even in an active mode digital wireless 
communication will in most case use some form of time division multiplexing 
in which communication is only performed during certain time slots. 

The difference in current consumption between a ‘communicating’ mode and a 
‘non-communicating’ mode can be as high as a factor 1000. In fact, in complex 
wireless systems there may be a wide number of different operating modes, 
each with their own current consumption requirement and timing profile. An 
example is for instance the fact that a synthesizer must be started up first and 
must be locked before a power amplifier can be switched on in order not to 
pollute the transmitted spectrum. Power supply regulation circuits for wireless 
systems must be specifically designed for this task. The efficiency of a basic 
supply regulator varies highly with a variation of its load. The efficiency in both 
a high current consumption and low current consumption mode may however 
be equally important for the battery life time since the low current consumption 
mode may for instance be a power preservation mode that takes up 99 % of the 
total operation time. Advanced power regulators must thus be used that are able 
to detect the current consumption of their load and that can change to a different 
operation mode, like e.g. from switching regulation to linear regulation, in order 
to keep the efficiency high over a wide range of several decades. This has 
however its limitations. The reaction speed of the regulator that can be achieved 
will make that there is each time some transition period that will need to be 
taken into account. More advanced power supply regulation systems will 
therefore start to use a communication protocol with the controller of the 
wireless system such that the regulator can be warned or even pre-wamed for a 
change in operation state with a digital command. 

5. Example implementation 

As example, the power management profile of an ultra low power FM receiver 
will be analyzed here. The discussion start by a presentation of the application 
and its requirements [9]. In the following sections it is further reviewed how 
these requirements impact different choices concerning power supply and 
power consumption issues. 

5.1. Miniature FM receiver for hearing aid applications 

Fig. 4 shows the selected architecture for the miniature FM receiver. This FM 
receiver has been developed as an add-on module for hearing aids. With this 
module clicked on to their hearing aid, hearing impaired persons can now not 
only use the built-in microphone of the hearing aid. They can also use a second, 
wireless, microphone that can be placed far away from the hearing aid, close to 
the audio source. In many difficult circumstances with a high background noise. 
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this can give a significant improvement of the audio signal-to-noise ratio. For 
this application worldwide different frequency bands have been allocated. These 
bands range from 72 MHz to 225 MHz. In these bands smallband FM 
modulation is used with a 25 kHz channel spacing. 




Fig. 4. Architecture of the low voltage FM receiver. 



The application as an add-on module brings some specific requirements. The 
first one is that it must be able to run from the hearing aids battery. This is 
typically a zinc-air battery, which has a working range of 0.9 V to 1.6 V. This 
also means that its full active power consumption must be limited to a few 
milliamps in order not to become dominant on the hearing aids battery lifetime. 
This must be extended with a number of power preservation modes in order to 
further save power whenever possible. 

A main requirement in this application is also that the FM receiver must have a 
programmable PLL on board. It will be discussed later on how this puts extra 
specific requirements on the technology choice and power supply selection 
strategy. If direct operation from the battery cannot be avoided, this brings 
specific stringent PSRR requirements to the synthesizer. 

Finally, the application as an add-on module for hearing-aid applications also 
brings in the requirements that the FM receiver must have a miniature size. In 
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some applications it will even be worn directly into the ear. In that case only an 
earphone is connected directly to the FM receiver and no additional hearing aid 
is used to perform further advanced signal processing on the audio signal. Fig. 5 
shows the FM receiver module and gives an impression of its size. It fits within 
1 cm^. This is realized by having as many components as possible integrated on 
to the chip. 




Fig. 5, The complete miniature FM receiver module. 

5.2. Direct battery operation 

The available options for the implementation of the power supply are in the case 
of this FM receiver highly limited. The most flexible solution would be the use 
of an inductive supply converter. As discussed in section 3 this can have the 
advantage of an optimum supply level choice, most suited for the selected 
technology, while offering also the benefit of giving only a small variation on 
the power supply level. Power consumption is very critical in this application, 
but an inductive converter can also offer an efficiency of more than 90 %, 
although generally realized at higher power levels. There are however two main 
reasons why inductive conversion is not an option here. First of all there is the 
miniature size requirement. It does require at least a low loss inductor and 
capacitor. Their size is directly related with how much the switching ripple must 
be reduced. In this case this is highly important since a ripple on the power 
supply will be picked up by the synthesizer and will result in spurii being 
generated on the local oscillator, thus reducing directly the sensitivity. 
Increasing the switching speed will reduce the size of the components that are 
required, but this comes at the cost of a reduction of the efficiency. The 
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maximum usable switching speed is 1 to 2 MHz and this is too low to make that 
the generated spurii would fall far out of band such that the antenna filter would 
still make the sensitivity OK. 

The alternative solution to generate a highly stable power supply voltage is the 
use for a low drop out linear voltage regulator. Since the power consumption is 
only a few milliamps such regulator could be placed on chip at a limited area 
cost. The problem is that the regulator generates a voltage drop. The output 
voltage of the regulator will thus be smaller than the smallest possible input 
voltage. In this case this would mean for a power supply level that varies from 
0.9 V to 1.6 V that the regulated voltage would be close to 0.8 V. This extra 
drop of 100 mV has a significant impact on the still achievable performance of 
the RF, analog and digital circuitry. The speed performance of e.g. a D-flip-flop 
configured as divide-by-2 block decreases for instance with a factor 2 for a 
power supply level change from 0.9 V to 0.8 V. 

The consequences of not being able to use a switching power supply converter 
or a linear regulator is that the only remaining alternative is to work directly 
from an unregulated power supply. This is a situation that is in general not 
commonly used for wireless systems. It are however the extreme requirements 
of ultra-low power consumption, and miniature size that dictate this only 
alternative remaining option of running the RF circuits directly from the battery 
voltage. The consequences for the circuit designer are significant and two-fold. 
One is the fact that all circuits will now have to cope with a power supply level 
variation that is a factor 1.8 between the lowest and highest value, while the 
circuit properties like e.g. gain or bandwidth should not change much over this 
range. Second is the fact that any ripple which will be induced on to the power 
supply by any block that draws some peak current at a certain frequency will 
directly be picked up by the RF circuits. In this application such ripple can be 
quite significant. Hearing aids with a high audio gain can draw a significant 
peak current from the battery. The battery is not capable of instantaneously 
delivering this peak current and an immediate voltage drop on the battery level 
will be the result, especially since there is not much room to place a large good 
quality decoupling capacitor that can level out these variations. A good PSRR 
will thus be a main requirement for the RF circuits that run here directly from 
the battery. 



5.3. Technology choice 

The fact that this FM receiver needs to operate directly from a single cell battery 
has a strong influence on the technology selection. The first main choice is 
between a bipolar, BiCMOS technology and a pure CMOS technology. A pure 
bipolar technology is actually no option since we also need a digital controller 
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on board and a lot of analog features need to be digitally programmable which 
is best implemented with MOS switches. The trade-off between using a 
BiCMOS or CMOS technology is mainly determined by the Vt values of the 
MOS transistors under consideration. As soon as the Vt of both the nMOS and 
pMOS is significantly under the Vbe of a bipolar transistor, it will be easier to 
realize the operation at 0.9 V in the CMOS technology. In this case a 0.25 pm 
CMOS technology has been selected with a medium Vt transistor option. The 
latter makes that transistor with a Vt of 300 to 400 mV are available. This gives 
the possibility to generate just enough overdrive at 0.9 V to achieve the 
operation at 225 MHz. The drawback is that these device show more leakage, 
but in combination with the standard transistors that have 500 to 600 mV Vt the 
circuits leakage can be optimized, 

A 0.18 pm CMOS technology would have these lower Vt transistors standard 
available. Since it is capable of handling 1.8 V power supplies it is also suited to 
work directly from the zinc-air battery. There is however not a lot of added 
advantage compared to a 0.25 pm CMOS with medium Vt option. Since we 
have to work with smallband FM modulation in this application 1/f noise is a 
critical factor in all circuits blocks and minimum length transistors are seldom 
used, even in such a critical circuit as the RF mixer. The blocks that would 
benefit directly in power consumption are high speed digital circuits, mainly the 
prescaler in the PLL. The drawback of using a 0.18 pm CMOS is of course a 
higher cost, while the chip area would remain the same since more than 70 % of 
the area is occupied by linear capacitors anyway and these do not really scale 
down in succeeding technology generations. 

Going further down in technology feature size is in this case not really an 
option. There is of course the further increasing cost, but additionally there is 
the power supply level of 1.6 V that cannot be handled by a 0.13 um CMOS 
technology. An extra on-chip regulator would be needed using thicker oxide 
transistors to come to the allowed 1.2 V power level. For the 225 MHz 
smallband application discussed here there is no need for this. More broadband 
higher frequency applications, working e.g. at 2.4 GHz, could benefit from the 
faster devices. 

A final note on the technology is on the variations to be expected over the 
processing conditions. The relative wide power supply level variation, almost a 
factor 2, will already result in enormous variations of analog properties if no 
appropriate circuits techniques are used. The low power supply level adds to 
this fact that the used overdrive voltages will be small and thus the relative 
variations become larger for the same variation on the Vt’s over the processing 
conditions. 
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5.4. Low power consumption 

The front-end of this FM receiver is a low-IF topology without mirror signal 
suppression. The main reason for not doing any mirror signal suppression is to 
save power. This comes at the cost of a 3 dB higher noise level. An occupied 
mirror channel is treated the same as when the selected channel is occupied. The 
user will have to switch channel. The IF filter is an 8th order low noise 
bandpass filter with 25 kHz center frequency. 

The back-end contains an FM demodulator followed by an audio filter and both 
a hearing aid and earphone output driver. The FM demodulator uses an analog 
delay line specifically designed for its low noise operation. An automatic 
frequency control loop sets the delay time and thus tracks the deviations on the 
25 kHz IF frequency by checking that no DC level remains present after FM 
demodulation. The core of the audio filter uses a programmable switched-cap 
implementation, preceded by an continuous time passive anti-aliasing filter and 
followed by a continuous time active smoothing filter and highpass filter. The 
hearing aid driver generates a signal that can be further amplified and processed 
by the hearing aid itself, while the earphone output allows to drive 1.5 mA 
directly into an earphone without the need of using a hearing aid. A squelch 
circuit is also present in the back-end. This shuts off the audio output when the 
signal quality becomes too low. 

A dedicated digital controller is on board. After a power on reset this controller 
temporarily switches on a voltage doubler to download the content of an 
external EEPROM that is 3D mounted on this chip as bare die. After this data 
download the voltage doubler is switched-off again to preserve power and 
prevent the addition of a switching ripple on the power supply. A two-way 
inductive link is also present. This is a short range low data rate wireless digital 
interface that is used to allow for remote control of the device by the user. 
Actions that can be performed by the user are for instance switch to the next 
available channel or scan all available channels. The same inductive link can 
also be used in a two-way mode for servicing and optimization of the settings 
by an audiologist. In this mode commands can be given that switch on the 
voltage doubler. Reading from and writing to the EEPROM via the wireless link 
becomes then possible. 

In order to obtain an ultra low power consumption, specific design techniques 
have been used in the ENA, VCO and synthesizer. The schematic of the 
implemented ENA with the antenna and its matching network is given in Fig. 6. 
The antenna is a high Q inductor wounded on a ferrite core. The Q of the 
antenna is about 100 giving a significant increase of the voltage level that is 
applied to the ENA’s input transistor. The resonance frequency is set by laser 
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trimming an external capacitor. The Q-factor of the input circuitry is so high 
that only a frequency band of 1 MHz is selected. In order to extend the 
frequency band of operating an on-chip capacitor bank is used to dynamically 
track the channel frequency by changing each time this bank setting together 
with changing the synthesizer frequency setting. 




For the VCO an LC VCO is used with external inductors to have a sufficiently 
high Q factor. Its schematic is shown in Fig. 7. The use of high Q external 
inductors is the only way to keep the power consumption low. The 0.9 V power 
supply makes that we can only use a small tuning range. Taking into account 
the charge pumps limitations, the useful range is from 300 mV to 650 mV. This 
small range makes that we need to use a relative high Kvco of 50 MHzA^. An 
external laser trimmed capacitor is used to set the band of operation and makes 
that the varicaps can remain small. The varicap that is used is a pMOS operating 
in inversion mode. The relative large Kvco makes this VCO sensitive to power 
supply variations. This is reduced as much as possible by using the cascoded 
current source. The 0 volt DC biasing of the resonance tank makes it also less 
dependent on power supply variation and by having the loop filters ground not 
connected to the on-chip ground and substrate, but via an extra bondpad to the 
external ground of the VCO we further ensure that the PSRR is good. 
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5.5. Alternatives to direct battery operation 

To run directly from the single cell zinc-air battery is not for all blocks in this 
application the most suited solution. The low frequency audio signal processing 
blocks in the back-end could run from a regulator that keeps the voltage 
constant between 0.8 V and 0.9 V. This would give these blocks an improved 
PSRR, but since this is far less critical than the PSRR of the synthesizer, this 
option has not been taken. 

A block that needs a higher power supply level than 0.9 V is the EEPROM that 
stores a number of programmable settings. In this case an external EEPROM is 
used that is 3D mounted onto the FM receiver chip. This EEPROM requires at 
least 1.8 V power supply, its current consumption is limited to 1 mA. This can 
be easiest realized with a capacitive voltage doubler if transistors are available 
that can withstand 3.2 V. 3.2 V can occur since the highest possible battery 
level will also be doubled. A capacitive voltage conversion circuit is, unless 
specifically designed for it, not a regulated system and the output will be an 
integer multiple or a fraction of integers of the input. In this case this is no 
problem for the EEPROM that can handle a wide range of power supply 
variation. Such a capacitive voltage doubler has been implemented here, using 2 
external capacitors. The main problem of this voltage doubler is that it generates 
a switching ripple on the power supply. This is solved here by having the 
EEPROM and the voltage doubler only switched on directly after a power-on 
reset. The dedicated controller initiates at that moment a full download of the 
EEPROM contents into its RAM and directly after that the EEPROM and the 
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voltage doubler is switched off again. Only after the voltage doubler has been 
switched off, audio reception is switches on. 

A second block that uses voltage doublers is the switched-cap programmable 
audio filter. The block itself is not fed from a doubled power supply, but the 
clock signals to the switches are doubled. The main difference is that these 
signals do not draw a lot of power since these are small capacitive loads. The 
consequence is that clock doublers do not really introduce a significant 
switching ripple on the power supply, while they can allow for a significant 
increase of the signal swing that can be used in the switched cap circuits. An 
increased signal swing results directly in a scaling down of area and power for a 
given signal-to-noise requirement [6]. The condition is again that transistors 
must be available that can handle 3.2 V. If not, alternative techniques, like 
switched-opamp can be used to increase to usable signal swing [10]. 

There are a number of switches for which it is not preferable to use a thicker 
oxide transistor such that they are able to handle the double power supply. 
These are RF switches; switches that are used in RF circuits that need to handle 
high frequency signals and for which their Q-factor is of high importance. 
Examples are a switched capacitor bank to trim the center frequency of a 
resonance tank in e.g. an LNA or VCO. In order to achieve the highest possible 
Q-factor, these switches do need to get the highest overdrive voltage possible all 
the time. In the case of this 0.25 pm CMOS technology, the highest possible 
allowed voltage with the standard transistors is 2.5 V. This situation can be 
solved by applying the voltage doubling on a signal that has been regulated first. 
Again, the current requirement for driving these switches is low. In this case the 
regulated voltage can be the bandgap reference signal. A 1.2 V bandgap 
reference signal is highly stable over power supply and temperature and in a 
doubled version it will result in a 2.4 V drive signal for the RF switches that is 
close to the maximum overdrive possible, while it does not vary much with 
power supply variations. 

5.6. PSRR requirements due to direct battery operation 

For the LNA and mixer the interference of power supply signals is only a 
second order problem. It is only in combination with a limited linearity that low 
frequency power supply noise can be mixed up to the high operating 
frequencies of the RF carrier. In most cases the received signals will be rather 
small and no effect will be present. When large signals would be received, the 
relative impact of the interfering power supply signals would be small. 

The synthesizer is far more critical towards the rejection of power supply noise 
that is picked up. Power supply noise can generate jitter on the local oscillator 
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signal, visible as spurii. Depending on their frequency these spurii will either 
introduce a distortion of the received audio signal or a reduction of the 
selectivity of the receiver. These effects will be present for both small and large 
received input signal levels. 

The synthesizer architecture is shown in Fig. 8. The synthesizer is a fractional-N 
synthesizer that uses a relative high comparison frequency of about 8 MHz. 
This allows for the use of a high PLL loop bandwidth of 30 kHz. This high loop 
bandwidth works as an advantage to keep the close-in phase noise low. This is a 
critical issue since we use smallband operation here. Smallband operation needs 
a low close-in phase noise, but the MOS transistors 1/f noise makes it more 
difficult to achieve this. The high loop bandwidth gives extra suppression to this 
1/f noise. The high loop bandwidth makes it also possible to integrate the loop 
filter on chip. This is critical since 5 extra external components would 
compromise the miniaturization possibilities. The LC VCO operates at the 
double frequency. It is followed by two CML logic dividers. A MUX makes it 
possible to receive frequencies in a range from 72 MHz to 225 MHz. The values 
of the external components determine which frequency band is actually 
selected. 




Fig.8. Architecture of the synthesizer. 



How the PSRR of the VCO is kept high was already described in section 5.4. 
Other blocks that are critical for the PSRR of the synthesizer are the logic gates 
in the reference and PLL feedback path. Different from the VCO, the spurii they 
introduce are not suppressed within the PLLs loop bandwidth. Fig. 9 shows the 
delay times of a NAND and a D-flipflop used as divider. The delay time is 
given in function of the power supply level for the 0.25 pm technology that we 
use here. At 1 V and below you can see a dramatic increase in delay time since 
the power supply level is coming close to the sum of the nMOS and pMOS Vt 
threshold values. This does imply that at 1 V and below variations on the power 
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supply will create significant timing jitter on the digital signals. In the 
synthesizer such jitter translates itself into an FM modulation of the carrier 
frequency according the following expression: 

^fdev = 25 • At^ • fLQ • fj^o^ (1) 

Remark that this means that the sensitivity of the divide-by-2 block towards 
power supply noise is independent of the frequency it operates on. A divider in 
the reference path working at 8 MHz has the same impact on PSRR as for 
instance a divider that works in the PLLs feedback path at 60 MHz. Careful 
design, scaling and optimisation of the logic used in the synthesizer is thus 
important. 




Fig. 9. Delay time of digital cells i.fo. the power supply. 

Fig. 10 shows for instance how the PSRR of the divide-by-2 block used in the 
synthesizers reference frequency path can be significantly improved. The output 
of the divider is recombined with the input signal in a NAND gate, such that the 
transition of the output signals falling edge is now determined directly by the 
input signal and delayed only by the NAND. The succeeding block is a phase- 
frequency detector that is only sensitive to falling edges. The function of the 
divider is now only to generate a signal that can be used to suppress half of the 
edges of the input signal. The smaller delay time and delay time variation of the 
NAND results in an improved PSRR. The same goes for the prescaler that is 
used. The bottom part in Fig. 11 is a traditional prescaler topology. The classical 
output of this prescaler is now used in combination with other internal states of 
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the prescaler to generate signals that can be used to suppress a selected number 
of the input signals edges. An actual transition of the output signal will be 
triggered through the critical path that is shown at the top. This limits the delay 
time and the delay time variation to that of only 3 NANDs and an inverter. 
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Fig. 10. Improved PSRR design of a divide-by-2 block. 




Fig.ll. Improved PSRR design of a prescaler. 

5.7. Results of the realization 

Fig. 12 shows a microphotograph of the chip. The RF front-end with the LNA, 
mixer and synthesizer, including the loop filter, is located on the left side. The 
chip is realized in a 0.25 pm CMOS technology and is 22.7 mm^ large. What is 
clearly visible here is that a large part of the chip area, in fact more than 70 %, 
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is taken up by capacitors. Main area consumers are the loop filter, the IF filter 
which needs the capacitor area for low noise and the back-end that contains a 
continuous time 100 Hz AFC loop filter and a 30 Hz audio highpass filter. 




Table 1 gives an overview of the chips main properties and measurement results. 
The chip operates from 0.9 V to 1.6 V. Off-mode current is 1.5 pA. Active- 
mode current including front-end and back-end signal processing is 2.2 mA. The 
carrier frequency can vary from 72 MHz to 225 MHz. The actual band of 
operation is selected with the external component values and synthesizer 
settings. Fig. 13 shows the audio output signal to noise ratio in function of the 
input signal power for an audio bandwidth of 7.5 kHz. At high levels the SNR is 
limited to 43 dB. This is determined by the close-in phase noise of the 
synthesizer. At small input signal levels a 10 dB SNR is achieved at -103 dBm. 
The RF bandwidth is at least 7 MHz. The audio bandwidth is programmable 
between 5.5 kHz and 7.5 kHz. The measured total harmonic distortion is 2 % for 
an input signal with a 5 kHz FM deviation. The remote controls sensitivity is 50 

pp,diff* 






244 



Supply voltage 


0.9 V- 1.6 V 


Off-mode current 


1.5 pA 


Active-mode current 


2.2 mA 


Carrier frequency 


72 MHz - 225 MHz 


SNR 


43 dB 


Sensitivity 


-103 dBm 


RF bandwidth 


7 MHz 


Audio bandwidth 


5.5 kHz - 7.5 kHz 


THD 


2% 


RCo/RP sensitivity 


50 pVpp, diff 


Chip size 


22.7 mm"" 



Table 1. Main properties of the FM receiver. 




Fig. 13. SNR in function of the input signal power. 



6. Conclusions 

Power management is an essential part of the design of any wireless system. In 
this paper the importance of selecting the most suited power supply level has 
been discussed in combination with how this power supply level can be 
generated with a regulator that is suited for RF circuitry. With an example of a 
miniature ultra low power FM receiver it has been demonstrated how running 
directly from the battery without any regulation has an enormous impact on the 
choices to be made for the wireless system. Especially the PSRR of the 
synthesizer is a most critical parameter, even influenced by the digital circuitry 
used in the PEL loop. Circuit design techniques have been introduced that 
improve the PSRR of the synthesizer significantly. 
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Abstract 

Power, energy and battery management cover a common field of 
expertise that is involved with the supply of power to electronic 
circuits. With portable equipment, small dimensions, low weight, 
long operation and lifetime of the on-board energy source are 
crucial. Efficient power conversion and interaction with the 
circuits to be supplied can bring down the power consumption. 
Only through measurement, control and the accompanying 
algorithms for power management, the targeted performance can 
be realized. The term management refers to the operation of all 
functions involved. Knowledge about all contributing parts is at 
the base of proper power handling and management algorithms. 
Electronic circuits must provide the means to make it happen. 
Knowledge about the IC part of electronic circuits is abundant, 
less is known of the ‘bulky’ devices as batteries, transformers and 
large transistors used with power conversion. These devices have 
a significant impact on size, cost and performance. This paper 
describes a number of subjects related to such devices rather than 
the ‘how’ of electronics and focuses on aspects for IC designers 
and small portable equipment. 



1. Introduction 

In the past, low power consumption of electronic circuits and efficient power 
supply of these circuits was hardly considered to be an issue of added value. 
Nowadays, both with mains-supplied as well as with battery-supplied products, 
such an approach has an unacceptable impact on product performance and price. 
Especially with portable electronics as cellular phones and audio players the 
form factor has become that important that only little room is available for the 
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batteries. At the same time, the number of dedicated supply voltages in any 
apparatus has grown significantly. This ranges from the lowest voltage for 
small-feature-size CMOS ICs to higher voltages for interfacing to the 
‘customers world’ that e.g. dictates high voltage for LCD displays or output 
power for a loudspeaker or transmitter. Traditional power supply solutions using 
copper-iron transformers and linear regulators dominated for a long time 
because of their low cost price. The market does not accept the consequences of 
their size, weight or losses anymore and thus more advanced solutions became 
unavoidable. The demand for multi-channel audio power systems drives the 
application of efficient class-D amplifiers since that allows for the design of slim 
cabinets. 

The mains voltage is the primary source of energy to most equipment. The use 
of rechargeable batteries grew dramatically because of the high power drain and 
intensive use of portable electronics. 

The user of portable electronics is sensitive to the ‘look and feel’ of the product 
exterior and thus the electronics inside has to provide performance that allows 
for attractive design. Miniaturization of energy storage and conversion devices 
is hampered by many physical constraints. Furthermore, cost price must be low 
for a competitive solution. Electronic designers are confronted with the total of 
all aspects involved. Extensive knowledge of the involved technologies and 
implementation consequences must be made available to designers since they 
take many decisions that have a significant influence on the final product. 



2. The Energy Chain 

All energy conversion and storage actions imply volume and losses with the 
accompanying consequences on cost, size, weight etc. Thus, an in-depth 
understanding of all energy conversion and storage physics and principles is at 
the basis of advanced power management. For this reason, the whole chain must 
be considered for allowing optimal design for an attractive product. 




Fig. 1 . Energy conversion and storage chain. 
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Mismatch between the available voltage and the required voltage implies 
voltage conversion that preferably must be done in an energy-efficient manner. 
A mains power supply has been used for an extensive period of time, since 
clearly the mains voltage is generally quite different from the required voltage. 
With battery-supplied equipment, a same structural discrepancy exists between 
the battery voltage that is generated by electro-chemical processes and e.g. the 
IC supply voltage that is dictated by the IC-process feature-size and is optimized 
to a single value. 

Energy efficiency of a system is directly related to the conversion and storage 
actions through the several energy domains. In fig. 1, a so-called ‘energy chain’ 
is depicted. This ‘energy chain’ contains the blocks that physically carry the 
currents and voltages of the electronic equipment. It shows the energy domains 
involved in the several parts of a system. In fig.l, subsequently, the electrical 
energy from the mains is converted via the magnetic domain in a transformer of 
a charger, then again into electrical energy that in turn is stored via the chemical 
domain of a battery, then again supplied as electrical energy to a DC/DC 
converter that applies the magnetic domain and provides the appropriate supply 
voltage to the actual system that finally has to provide output in the desired 
energy domain. 

The part designated as load represents the whole application that has to be 
supplied, e.g. a cellular phone or portable CD-player. The energy domains 
shown in this block are the actual energy domains that provide the actual desired 
functionality, i.e. light e.g. for data display and video, motion for audio and EM 
radiation for communication over a long distance. 

Underlying, but not depicted in detail, with this ‘energy chain’ is a significant 
amount of power management that includes measurement in and control of the 
‘energy chain’ circuits. 

battery DC/DC PA 

0.1W 0.5W 4W 
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As an illustration of practice, fig.2 shows the power flow of the transmit part of 
a GSM phone. An overall efficiency of only 30% is achieved from energy stored 
in the battery to antenna output. The high loss assigned to the power amplifier 
(PA+) in fig.2 includes the PA, filters and antenna switch. It must be understood 
that with GSM a saturated PA is applied that has a best-case efficiency of about 
60%. With future communication systems, linear PA’s are required that have a 
much lower efficiency. Fig.2 clearly shows that the poor efficiency of the 
transmitter stage necessitates much higher power levels to be converted and 
delivered by the battery. Improvements in efficiency at the end of the chain are 
rewarded by lower cost and smaller size of preceding parts in the chain. 

Following the energy chain from left to right, in the past years a number of new 
means have been provided for the design of power management systems. 

• Introduction of an IC process for mains voltages, [ref. 1 , 2] 



• Physics-based modeling of transformers [ref.3] 

• Digitally-controlled DC/DC converters [ref 4] 

• Physics-based modeling of batteries [ref 5] 

• A/D converter with low offset voltage [ref 6] 



3. Power Management 

Exploiting the available energy to a maximum is a useful approach especially 
when the energy must come from a portable source. Improving the efficiency 
from 90% to 95% may not be very fruitful from an energy standpoint of view. 
However, this implies half the losses that need to be accommodated by the 
several devices. Lower losses allow for smaller size or cheaper packaging. 

This latter argument, partly responsible for cost and size, is driving the effort 
towards very high efficiency. 

Apart from providing an optimal power supply in an efficient manner, power 
management also involves management of power consumption. Supply voltage 
and bias optimization or enabling circuits only to operate in certain time slots 
are examples of power awareness in modem systems. Power supply provision 
and consumption should not be considered as a separate issue, but mutual 
interaction with the actual system is involved as well. 

Power management can reduce losses by providing the optimal bias current or 
supply voltage, and by switching circuits off when they are idle. Active clock- 
management can provides an optimal local clock frequency to reduces losses. 

Traditionally, the system blocks of the signal path and power supply are 
independent. An optimal system would also use the actual desired output e.g. the 
desired light, EM radiation or speed as a feedback to the system blocks. An 
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example of this is depicted in fig.3, where the voltage VDD for the digital 
circuits is dictated by the application itself in order to run at the lowest adequate 
supply voltage. 




clock supply 



VDD 



Fig.3. Adaptive power supply for a digital circuit. 

The power consumption of digital CMOS is P = f*C*VDD^ and obviously 
strongly dependent on VDD. A common supply source has 10% voltage 
tolerance. An IC must be designed for worst-case conditions such that it can 
operate within a certain process window and temperature range and at lowest 
supply voltage. For reliable operation the margins of VDD, process and 
operating conditions must be added, thus leading to tens of percents higher 
supply voltage than possible with an adaptive supply. In the example of fig.3, 
the speed of the critical delay path circuit, representative for the critical delay of 
the digital circuit block that implements the desired function of the IC, is 
compared with a clock signal. As a result of the comparison, the supply voltage 
VDD is controlled to such a value that the desired speed of the critical path just 
meets the required speed given by the clock frequency, and thus, the digital 
circuit block that operates on the same VDD is guaranteed to provide its function 
with the proper performance. In this way, the minimum supply voltage for the 
whole process window and application conditions is generated at minimum 
power consumption. 



4. Energy Storage and Conversion 

Storing energy is directly linked to volume that in turn is directly linked to price. 
An amount of energy is expressed in joule or watt*second. The highest energy 
density is provided by chemistry, both by volume and weight. Well-known 
examples are combustion of fuel used in cars or for heating and electrical energy 
delivered by batteries. Demand for electrical vehicles or hybrids is a major 
driving force in the development of new energy storage and conversion systems. 
Small portable electronics take advantage from those developments. Passive 
electrical components as capacitors or inductors have a dramatically low energy 
storage density compared to batteries. The energy storage density of 
rechargeable penlight or AA-size batteries has improved about a factor of 4 from 
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the introduction of first NiCd to the current NiMH or Li-ion batteries. A 
capacitor with a comparable discharge voltage range as a modem AA-size 
NiMH battery should have the impressive value of about 20 kF. When empty, 
this battery shows about the same capacitance as a so-called super-capacitor of 
the same size, i.e. in the order of 10 F. 

A power supply unit is traditionally used to accommodate the mismatch between 
mains voltage and desired supply voltage. Mismatch also exists between the 
voltage provided by the electro-chemistry inside batteries and the desired 
optimal voltage. The discharge voltage range from a full to an empty battery is 
about 40%. CMOS prescribes a specific and accurate supply voltage and some 
intermediate conversion is required that converts the voltage in an energy- 
efficient manner. In the past a too-high voltage was provided and the difference 
was dissipated. Now, for a same reason as the converters for the mains voltage, 
efficient DC/DC converters between battery and electronic circuits have become 
common in advanced systems. 

Solutions for power conversion use switched-inductor or switched-capacitor 
techniques. In these converters, energy from the input source is temporarily 
stored in an inductor or capacitor and subsequently transferred to an output 
buffer that supplies the load. A continuous consumption of power implies the 
transfer of amounts of energy at a certain rate or switching frequency. Since 
only small amounts of energy can be transferred via capacitors or inductors, this 
has to be done at a sufficiently high frequency. Transistors, either discrete or 
integrated on an IC, implement the involved switches. Capacitors on ICs have 
>10% parasitic capacitance to ground while 'floating’ capacitors are needed. 
Furthermore, semiconductor technology is basically a surface technology and 
thus cannot provide significant on-chip energy storage since the latter requires 
volume. Increasing the switching frequency is generally linked to higher losses. 
Therefore, fully integrated power conversion on silicon can only provide limited 
power. In practice, monolithically integrated charge pumps exist up to the milli- 
watt level. For higher power, discrete energy storage devices remain required. 
Above constraints dictate that devices for energy storage and conversion remain 
rather noticeable. 



5. Design Aspects of Power Conversion Electronics 

Already for long time, the design of electronic circuits in ICs is performed 
through software. Both general-purpose circuit simulation packages as well as 
dedicated software are available for a wide range of performance levels. 
Especially with IC design, bread boarding has completely become obsolete and 
processed silicon is in good agreement with simulations. The design of power 
conversion circuits has lagged in this somewhat. This is partly because of the 
wide dynamic time range of the circuit techniques involved that provide the 
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required high conversion efficiency. Switching transitions are in the nanosecond 
range, switching frequency is up to several MHz, loop bandwidth in the tens of 
kHz range and the supply voltage ripple is e.g. in the lOOHz range or lower 
during start-up. This has brought about a split-up between circuit simulators that 
can do ‘real-time’ time-domain analysis of separate blocks and system-level 
software that aims at analysis of the function. With the trend to higher switching 
fi’equency, it becomes required to merge both and to incorporate accurate real- 
time circuits in the system-level analysis. For the actual design of the converter 
itself it becomes increasingly required to model all the parasitic elements of the 
‘power train’, i.e. the circuit parts where the power is handled and the switched 
current flows. An example for the latter is the VRM (voltage regulator module) 
for high-end microprocessors. A VRM has to supply e.g. 1.2V@100A within 50 
mV window under full dynamic-load conditions. The very high dl/dt involved 
(lOOA in a fraction of a psec, controlled e.g. by a mouse click starting a 
program) necessitates modeling of every device and mm of PCB wiring down to 
Abactions of nHs and mQs with great accuracy. The term ‘switching noise’ used 
with switched-mode conversion may easily offset small-signal circuit designers 
since this switching noise may be larger than the supply voltage itself and 
appears across virtually no distance of wiring. 

Efficient power conversion electronics even at lower power levels becomes 
rapidly unavoidable in a growing range of applications. This brings an 
increasing part of the design of such systems from power-conversion experts to 
IC designers. They need better design tools and models that make repeated 
prototyping superfluous. Accurate compact models suited for time-domain 
simulations are required, in which knowledge of other disciplines is ‘frozen’. 



5.1 Losses in Electronics 

As a first rule-of-thumb, for high power it is recommended to choose a higher 
supply voltage. In this way, current levels and thus conduction losses are lowest. 
At the same time, voltage-related switching losses will increase. Requirements 
of miniaturization of inductors and capacitors ask for higher switching 
frequencies with accordingly increased switching losses. 

Components involved with power handling such as switches, transformers, 
inductors and capacitors, have properties that introduce losses. Such losses 
include conduction losses due to resistance in the current path and switching 
losses due to state transitions in converters. A constant flow of power instead of 
power bursts provides lower losses. Buffer capacitors handle power bursts and 
equalize the power flow in other supply paths. For this, it is required that those 
buffers have a lower equivalent series resistance (ESR) themselves than the 
current path that they bypass. This requirement may determine the capacitance 
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and size. For comparison of several types of capacitors, one may use a FOM 
(figure-of-merit) C*ESR. 



5.2 Switching of a MOS Transistor 

The workhorse in power converters is the MOS transistor. In fig.4 a low-side 
MOST is depicted with a gate drive and load current. Turning on of this MOST 
can be described as a sequence of time intervals. Fig. 5 shows the several time 
intervals involved during switching on of the transistor, starting from VGS=0 
and VDS=VDsup. The root cause of dissipation in every interval provides 
information on how it can be influenced. Timing diagrams can be made for 
turning on or off, both for high-side and low-side transistors. At t2, ID = Hoad. 
The gate plateau region (t3 ->t4) i.e. VGS is constant results from the feedback 
path via CGD when all gate-drive current flows into CGD. During this plateau 
region (t4-t3 ~ VDsup»CGD/Ig), both high VDS and ID exist at the same time. 
Normally, a FOM = QGD»RDSon is used for the selection of a transistor that 
refers to this region were the dissipation is highest. Fig.5 still is a simplified 
version of practice. Especially with high switching frequencies, the usual FOM 
becomes inadequate. Aspects involved for better selection or design of a 
transistor include body diode recovery, non-linear capacitances, internal gate 
resistance and lumping into more transistors. The gate drive current is supplied 
from a source with associated losses in the gate drive circuit. 




Fig.4. Test circuit for tuming-on of a transistor. 
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Fig.5. VGS, Id and Vds during tuming-on of the transistor in fig.4. 



5.3 Switched-Mode Power Converters 

Switched-mode converters can convert voltages and currents to another value 
with high power efficiency. These are based on switched-inductor and switched- 
capacitor techniques. Switched-inductor converters are known in a wide variety 
of topologies that e.g. convert an input voltage to a higher voltage (up or boost), 
to a lower voltage (down, buck, class-D) or have a ripple-free output (Cuk). 
Such converters maintain their efficiency under a variable input/output voltage 
ratio, because an inductor can store or release energy with low losses 
independent of the voltage across it. This, together with a control loop that 
defines the output voltage, makes these converters an attractive solution that is 
widely applied. However, the inductor remains a relatively bulky and expensive 
device. 

Many textbooks explain topologies, operation and losses of switched-inductor 
converters. Such converters are strongly non-linear systems with accompanying 
stability issues in the large-signal domain. Small-signal approaches are used as 
well for the design of loop stability. The structural impact of size and price 
many times forces the design of dedicated and optimized converters. Therefore, 
the design of power converters remains a field of expertise by itself. 

Within certain constraints, switched-capacitor converters can have high 
conversion efficiency as well. 
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As an example, the properties of a capacitive voltage-doubler or charge-pump 
are described. The topology and equivalent circuit diagram are depicted in fig.6. 




Fig.6. Voltage-doubler with equivalent circuit diagram. 

During a time interval D*Tp a pump capacitor Cp is charged via switches by an 
input voltage and sub-sequentially during (l-D)Tp stacked via other switches on 
top of the input voltage and connected to an output buffer capacitor that 
maintains the pumped-up voltage (with Tp being the period time of Fswitch). 
When the total resistance in charge or discharge path for Cp both equals R, then 
the output impedance of the charge pump is a resistance Rout of which the value 
is given in fig.7. The graph shows two asymptotic lines for the value of Rout that 
can be derived by assuming that Cp « R*Fswitch or Cp » R*Fswitch and the 
current through the switches either returns to zero or remains constant in its time 
interval. The actual value of Rout at the intercept point of both lines is about 
30% higher then the intercept value. Assuming only conductive losses in the 
switches, then the efficiency follows Ohm’s law, i.e. q = Vout/2Vin = 
Rload/(Rout+Rload). Varying Fswitch or D changes the value of Rout and enables 
some output voltage control. 

Storing energy in a capacitor from a voltage source via a resistive path is only 
efficient (WstoredAVloss is high) when the voltage across the capacitor 
approaches the charging voltage, i.e. resembling a relatively unloaded condition. 
This implies a small amount of energy that is transferred every cycle and a large 
fixed amount of energy stored in the pump capacitor, i.e. effectively a poor 
energy density. Furthermore, they are only efficient for a certain fixed voltage 
conversion ratio that is determined by the topology. Constraints for price lead to 
very low-power applications when fully integrated or otherwise to a number of 
discrete capacitors with according number of pins to switches on an IC. 
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Fig.7. Output resistance of the voltage-doubler from fig.6. 



5.4 Measurement 

For accurate control it is required to have accurate information. In systems 
where power management is involved, a wide diversity of variables must be 
observed. Sensors for temperature, pressure, light etc. with appropriate sensor 
electronics provide the information for the control system. In electronic circuits 
such information is mostly an electrical signal as voltage or current. The 
dominant device for accurate current sensing is a resistor that converts current 
into a voltage. Accurate voltage measurement of low voltages is the basic 
function for sensor electronics. An analog-to-digital converter has been designed 
that combines a high bandwidth and a very low DC offset [ref. 6]. This ADC is 
suited for accurate measurement of current, voltage, temperature, or small 
voltages coming from other sensors. When lower bandwidth is required, lower 
DC offset can be realized. Offset can also be subtracted in the digital domain. 
However, large initial offset also implies larger drift. 

A reference source is another crucial element in measurement and control 
systems. Voltage reference circuits (band-gap reference) can be designed in 
modem CMOS processes that have an accuracy of about 0.5%. Higher accuracy 
is feasible by on-chip trimming provisions e.g. using non-volatile memory. An 
example of the application of an ADC for data acquisition for battery 
management systems is depicted in fig. 8. A switch matrix can connect the ADC 
to measure the battery voltage via R1 and R2, temperature via R1+R2 and 
Rl+T, current via Rs, chip temperature via PTAT, together with a known 
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voltage REF that is used for interpretation of the measured voltages. Shorting 
the input of the ADC allows for processing of the offset in the digital domain. 




5.5 Partitioning 

Handling large current and dissipation directly influences many aspects of the 
physical design. The wide range of power conversion systems with associated 
discrete devices and the dimensions of Ls and Cs, heat sink, mains isolation and 
electronics have a direct impact on the FOM (W/m^) and cost price. This has 
lead to the development of components with dedicated technology that have a 
good price/performance ratio. Significant efforts are made to miniaturize or 
integrate discrete passive components or electronics. However, many times 
partitioning of the system over separate hardware parts is more attractive [ref 2]. 
With power electronics, complete integration of the electronics can be attractive 
for moderate or low power and for systems with limited complexity [ref.l]. High 
power or current requires distributed heat sinking using discrete devices whereas 
high complexity requires high-density IC technology such as sub-micron 
CMOS. 

When using high- density CMOS, there is a preference to apply digital circuits 
only. Ref 4 describes a digitally controlled DC/DC converter with a minimum of 
analog circuits that makes it attractive for embedding it on digital ICs. 

A cellular phone applies several voltage domains: high density CMOS running 
e.g. at 1.2V, the transmitter running at -3V and the charger connection must 
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accommodate up to ±20V for a 3.6V battery. An active-matrix LCD module 
generates its own ±20V for driving the gates of the thin-film transistors. 
Power-electronics circuits typically are connected to devices or wiring that can 
discharge energy in the circuits. Such operation conditions force deviating 
requirements, especially for IC processes. Especially problems due to the 
activation of bipolar parasitic transistors can be prohibitive to apply many IC 
processes. The latter and the increasing amount and wide range of dedicated 
voltages generates the demand for specific power management IC processes that 
safely can handle all voltages and currents involved. 



out 




Fig.9. Bipolar parasitic transistors at the output of a CMOS stage. 

As an example, fig.9 shows parasitic transistors at the output of an output stage 
in an N-well CMOS process. In case that both MOSTs are off and current flows 
into the output, the output voltage will be lifted a junction above VDD and all 
current is injected with the P-drain/N-well acting as emitter/base of a parasitic 
PNP. This yields high dissipation; possible offset of other circuit parts and a 
local voltage drop internal in the bulk that may activate other parasitic 
transistors. When the current is flowing out of the output, the output voltage is a 
junction below the bulk and the N-drain/P-bulk of the NMOST acts as an 
emitter/base of a parasitic NPN. Again, high dissipation, offset of other circuit 
parts and activation of other parasitic bipolar transistors can occur. Said 
activation of parasitic bipolar transistors can lead to latch-up. The picture shows 
only one collector for each transistor. Actually, many nearby regions may act as 
a collector and other bipolar transistors could already be indicated in fig.9. IC 
processes suitable for power-conversion output-stages involve vertical stacks of 
diffusions or oxide-isolation that isolate the devices. 



6. Physics-based Modeling 

The concept of ‘power conversion design’ is used for two categories. One is the 
application of existing control ICs with selection of external components for 
optimization of the performance. The other is the original design e.g. of the 
control ICs and algorithms themselves or a fully integrated system on chip 
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including the power train and the external components. IC design tools and so- 
called compact models are available for CAD for IC-design. A compact 
transistor model is an electronic network and set of equations that replaces a 
transistor in circuit simulations. Advanced transistor models are based on 
equations of the physical processes involved, whereas the parameters are based 
on data derived from physical constants and the IC processing. As a result, 
models are obtained that are almost unconditionally reliable, independent on 
their field of application and without being verified in that application. The 
parameter values are correlated with actual process parameters and statistical 
analysis over process spread and full temperature range is common practice. 
Such a design approach is not feasible with bread boarding. 

Transistor models and parameters are made available in parallel with IC process 
development such that circuit designs are ready when a new process is ready for 
manufacturing. This approach of physics-based modeling allows for a highly 
reduced design time. The current level of complexity and accuracy of physics- 
based transistor models for IC design are the result of tremendous accumulated 
effort. Compact models connect the worlds of semiconductor expertise and 
circuit specialists. Without accurate models of all parts, trial-and-error hardware 
evaluation may be required very likely leading to redesign. 

Power conversion brings about a number of additional application conditions. 
Models of discrete transistors used with power conversion are generally more 
simple than those used with IC designs. Especially when high switching 
frequencies are involved, these models become unreliable. Bi-directional current 
flow in the output can generate signals exceeding the supply voltage. This 
generally forward biases a parasitic PN junction, e.g. the body diode of a 
discrete power MOST. With junction-isolated ICs, these PN junctions are base- 
emitter junctions of parasitic bipolar transistors. Modeling of the parasitic diodes 
is generally rather basic; the extension of those PN junctions to parasitic 
transistors is completely lacking in models for IC circuit simulations. 

In the field of power converter design, time-consuming hardware evaluation or 
redesign of electronics was common practice since e.g. no accurate models of 
transformers are available. In practice, many times one uses behavioral models. 
However, they only are adequate in a limited range that fit to a specific situation 
i.e. within the restrictions of a certain operation condition that was already 
predicted and measured. Behavior models of not-yet existing components can 
only be rather general. As a consequence, less optimal designs could be brought 
into production. 

Models comparable to those of transistors are also required for devices such as 
transformers, batteries, fuel cells etc. Then, the expertise and accumulated 
knowledge becomes better accessible to designers. The common platform is 
mathematics that can be dealt with by computers. Thus, accurate prediction 
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instead of confirmation of the behavior of a system can be obtained before a 
system exists. In addition, a better understanding and verification can be 
obtained since virtual measurements, even of the internals of a device, and 
variation of parameters is rather easy and reproducible with simulations. 

An advantage of physical modeling is given by the fact that aspects that later in 
time are studied and understood can be added to the existing model and provides 
knowledge to optimize the devices. 



6.1 Transformer Modeling 

With the provision of reliable scalable transistor models, the number of design 
cycles of an IC is drastically reduced. With power conversion however, there is 
a critical interaction in the design phase with the behavior of transformers and 
inductors. Compact transformer models that show good correspondence with 
transformers in practice over a wide application range are rather complex. In 
practice, simplified models are made with a specific parameter set that is derived 
from extensive measurements on an existing transformer. The permeability of 
the core material p is normally expressed as function of frequency. This is not 
suited for time-domain analysis. Generally, non-physical simple models are 
created and parameterized for a single frequency with non-physical fit- 
parameters, i.e. some form of reverse engineering. 

The creation of a model with parameter set of an existing transformer is already 
rather complicated. An optimal converter requires an optimal transformer for its 
purpose together with the design of the electronics. Generating a sufficiently 
accurate model and parameter set of a non-existing transformer, suited for time- 
domain simulations, was until recently not feasible for power conversion 
engineers or IC designers. Ref.3 describes a method that allows for the creation 
of a model departing from the physical construction (geometrical data, winding 
data etc.) of a transformer with the derivation of the parameters based on these 
construction data and material parameter values from physical data handbooks. 
Transformers are complex devices that show many unexpected phenomena to 
non-experts. Therefore, some expertise is still required to propose a transformer 
with a desired performance. However, such expertise can gradually be acquired 
following the systematic approach from ref 3, since it helps to build up better 
understanding of the internals of a transformer. The behavior of the proposed 
component could be verified and applied in the further converter circuit design 
much earlier than otherwise. Small modifications for optimization may be done 
departing from the same model and by adaptation only of the parameter values. 
For large modifications it may be required to generate a new model first. Fig. 10 
shows the similarity of measured and simulated drain voltage and current of the 
fly-back converter of ref 1 operating in discontinuous mode. 
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Fig. 10. Measured and simulated drain voltage and current. 



6.2 Battery Modeling 

Batteries are the dominant source of energy in portable electronics. A battery is 
defined as a series of one or more (electro-chemical) cells in series. One can 
distinguish primary or non-rechargeable batteries and secondary or rechargeable 
batteries. Rechargeable batteries have become crucial in power management 
electronics for portable systems. In-depth understanding of their behavior is not 
a strength of power engineers or IC designers. With the increasing relevance of 
portable power with the associated complaints about the performance of 
batteries, it is required to bring the field of expertise of electro-chemistry to 
electronic designers as well. This is a pre-requisite for optimal exploitation of 
batteries, both for a single charge/recharge cycle as well as for cycle-lifetime. 
Knowledge about the state-of-charge, to be applied for charge control or for 
display to a user, is at the base of ‘Battery Management’. Electronic designers 
can perfectly well apply ICs or design ICs without knowing what is inside the 
ICs respectively how transistors work in detail. The worlds of electrical 
engineers and semiconductor experts are connected via compact transistor 
models. Comparably, the world of electronic engineers and battery experts can 
be connected via compact battery models that are based on the scientific 
knowledge of electro-chemists. This approach is rather new and is not yet up to 
the level of semiconductor solid-state physics with its 50 years of accumulated 
effort. Chemical processes can be described by mathematics. However, some 
processes are non-reversible or not yet well known or described and some 
physical constants are not known to great detail. Also, the morphology of the 
structure and mechanical and thermal aspects involved make it a big challenge 
compared to semiconductor physics. The change of performance over life 
(aging) of batteries is not fully known since that depends on the batteries’ 
history that can be quite different for each battery. 




263 



A significant and useful result of the effort is described in ref. 5. Even with 
incomplete models, the impact of different charge/discharge profiles and inside 
knowledge of state-of-charge or remaining battery capacity can already be 
studied. The correlation between actual behavior being the result of the several 
chemical processes, and a driving parameter such as current or temperature is 
only meaningful with physics-based modeling. Fig. 11 shows P,V,T simulation 
results of a NiMH battery that very well fit with measurements. The physics- 
based battery model actually provides a virtually transparent battery in which all 
processes such as energy storage can be observed separately in detail, applying 
dynamic current in ‘real-time’ simulations with arbitrarily short time steps. This 
allows for optimization of the charging and state-of-charge algorithms. It can 
also be used for fine-tune the chemistry of a battery. 




Fig.l 1, simulation results of battery voltage V, battery temperature T and gas 
pressure P consisting of PH 2 and PO 2 , during charging and overcharging with 
constant current (0.55 A) and resting. 



7. Energy Sources for Portable Electronics 
7.1 Batteries 

Many users perceive batteries as a nuisance. Advise on how to handle batteries 
to enhance their performance is not a solution. Battery management aims at 
providing optimal comfort and performance for the user of batteries. This has to 
be done by electronic circuits and involves optimal exploitation of the stored 
energy per charge, proper charging algorithms, indication of the remaining 
charge or operation time and a long lifetime. Knowledge of the state-of-charge. 
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actual capacity, impedance, response to charging etc. form an input for a 
“Battery Management System” that takes some concerns out of hands of a user. 

Rechargeable batteries for small portable electronics are dominantly NiCd, 
NiMH and Li-ion types. 

Fast charging of batteries is a demand in which a balance has to be chosen 
between charging speed and cycle life of the battery. Overcharging is 
detrimental to all batteries. Sufficiently accurate knowledge about the state-of- 
charge is required for proper charge management. The faster a battery must be 
charged, the more accurate this information needs to be. With NiCd and NiMH 
batteries, a ‘full’ battery cannot store more energy and additionally supplied 
energy is converted into heat. Therefore, the charging process must be 
terminated. The negative temperature dependency of the cell voltage of NiCd 
provides information for terminating a high charging current. NiMH does not 
always have such a negative temperature dependency. Then, the cell voltage is 
not a proper criterion for fast-charge termination anymore and a temperature 
sensor needs to be placed against the battery. Li-ion batteries are very sensitive 
to their charging voltage. Dependent on the chemistry, deterioration occurs at 
voltages above 4.1V or 4.2V. Additional charge above that voltage is stored, but 
the associated higher voltage implies a much shorter cycle life. Fig. 12 shows the 
relation between cell voltage, stored charge and cycle life. A too low cell voltage 
is at the expense of battery capacity, whereas a too high voltage is at the expense 
of cycle life. It is clear that accurate charge voltage control is required (e.g. 
±25mV) when a certain capacity and cycle life must be provided. Li-ion 
batteries may explode when they are overcharged. In general, Li-ion batteries 
are only provided as a pack that already includes several safety measures, one of 
which is a ‘safety-IC’ that disconnects the battery before it enters detrimental 
over-or undercharge regions [ref 5]. 




Fig. 12. Capacity and cycle life of a Li-ion battery versus charging voltage. 
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A battery can be represented by an EMF, that is a function of the remaining 
energy, together with internal impedance. It is common practice to define a 
battery as being empty when the terminal voltage becomes lower than a certain 
value. However, the EMF is higher than the terminal voltage and thus still some 
energy is left in the battery. This can be recognized when the load is switched 
off and the battery voltage slowly relaxes to its EMF. Said practice is based at 
the assumption that below that certain value the supply voltage is not sufficient 
anymore. As a consequence of the internal impedance, battery graphs show a 
smaller capacity at a higher current. A so-called empty battery still may contain 
some energy that can be exploited by a DC/DC converter when it accommodates 
a lower input voltage than the 'empty-battery’ voltage. 



7.2 Photo- Voltaic Cells 

Photovoltaic or solar cells can provide electrical energy without being recharged 
before. The maximum influx of energy on earth from the sun is about IkW/m^ 
and the conversion efficiency of solar cells is about 10%. This implies about 
0.5W for an area of 5x10 cm^. In less optimal conditions, especially inside a 
building, where only part of the spectrum of the light passes the window glass, 
the generated power may be a few orders of magnitude lower. 

Solar cells generally are PN junctions and generate a current in parallel with this 
junction proportional to the incident light. At no-load, the output voltage is 
determined by the forward voltage of the internal diode at the given current with 
according temperature dependency given by the well-known diode equation. 
When short-circuited, the output current is the full photo current. The output 
characteristic is depicted in fig. 13. In this figure, also the maximum-power-point 
(MPP) is given. The MPP exists where a constant power curve (hyperbola: 
V*I=Pmax) touches the graph of the solar cell. With the common silicon solar 
cells, the MPP is as about at 0.5V. Since this is rather low for the supply of 
electronic circuits, generally more cells are connected in series. For high power, 
many cells are arranged in series up to hundreds of Volts at the output. 

A switched-mode converter that loads the solar cell can be arranged with a 
control loop that causes the solar cell to operate in the MPP. When maximum 
energy must be obtained from solar cells, one must apply tracking to the sun, 
operation in the MPP and energy storage means (capacitor, battery, thermal 
store, electrolysis of water as fuel for a fuel cell) to accommodate temporal 
difference in demand and supply of energy. 
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Fig. 13. Equivalent circuit of solar cell and output characteristic. 



7.3 Fuel cells 

A fuel cell (FC) is a device that converts the energy of a fuel directly to 
electricity and heat, without combustion. Since chemical energy is known to 
have a high energy density and electrical energy without noise is an attractive 
form of energy, fuel cells attract great interest from many parties. The 
tremendous growth of portable electronics as laptops and cellular phones and the 
disadvantages of rechargeable batteries are a major reason for this. However, 
alternative energy systems like FCs have their disadvantages as well. Fuel cells 
are already in use in high-power professional applications. Downscaling of fuel 
cells to lower power has so far not yielded an attractive alternative to 
rechargeable batteries. Several issues that complicate the use of fuel cells in 
cellular phones are discussed below. 

One could consider fuel cells as small power plants with many operational 
aspects in common. An infrastructure around the FC is required to provide the 
proper conditions for the chemical process in the reactor itself. The involved 
reaction temperature ranges from about 50°C to lOOO^C. Isolation of the hot 
cells to the outside and the fuel storage container both take volume. Flow and 
pressure of the fuel may involve sensors and electro-mechanical means as 
pumps or valves and some gas is developed that should be recycled or 
exhausted, possibly by a fan. The exhaust gas of some fuel cells may be water 
vapor, but even this is not a trivial matter for cellular phones. The operation of a 
fuel cell with its supply of fuel, exhaust of reaction products and temperature 
requirements imply a start-up time and a relatively slow time response. Thus, in 
practice, a fuel cell needs to be accomplished by another energy store, e.g. a 
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battery with low impedance. The ESR requirement implies a significant size of 
that energy store. Starting up of a fuel cell system requires e.g. a battery as well. 



For portable electronics, it is suggested that DMFC (Direct Methanol Fuel Cell) 
followed by hydrogen technology will be the most likely types. Since methanol 
is a more convenient fuel than hydrogen, hydrogen fuel cells can be extended 
with a reformer that derives the hydrogen from methanol. Fig. 14 depicts the 
principal operation of a DMFC. Supply and exhaust of chemicals need to be 
managed by sensors, actuators and power management electronics. 
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Fig. 14. Direct methanol fuel cell: CH3OH + H2O + V2O2 CO2 + 3H2O. 

Fig. 15 shows the typical output characteristic of a single fuel cell. An electro- 
motive force (EMF) and complex output impedance can characterize a fuel cell 
behavior. Per cell, the open-circuit voltage (OCV) is about IV whereas the 
useful closed-circuit voltage (CCV) goes down to about 0.5V. The 
accompanying voltage drop implies losses. The low operation voltage is rather 
inconvenient for low-cost electronics but may be acceptable for a IW level. For 
higher power, more cells need to be connected in series. 

A fuel cell is vulnerable for poisoning by contamination. For example carbon- 
oxide is a poison to fuel cells operating at relatively low temperatures such as 
the proton exchange membrane FC (PEMFC). Sulfur compounds, present in 
natural fuels, or air for the oxygen are a major source of poison for FCs. 
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Severe safety issues already exist with Li-ion batteries. Safety and legal liability 
are issues for the introduction of PCs in cellular phones as well. 
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Fig. 15. Typical output characteristic of a single fuel cell. 



7.4 Energy density of energy sources 

Overviews of energy systems many times refer to intrinsic energy density, i.e. 
only the principal technology. In practice, with small dimensions, the overhead 
of packaging or infrastructure is far from negligible. An advantage of Li-ion 
chemistry is that it provides a higher cell voltage than NiMH and NiCd and thus 
allows for fewer cells. This reduces the encapsulation of the cells and housing 
volume that is quite noticeable with small batteries already. The infrastructure 
surrounding a fuel cell is significant as well. Scaling down the infrastructure as 
exists for high-power fuel cells to one for low power raises many challenges. 



7.5 Losses of energy sources 

Energy sources like batteries and fuel cells have internal impedance, i.e. the 
output voltage drops as the load current increases. Part of this output impedance 
is just because of passive resistance of materials present in the current path. 
Another part is because of physical principles that it takes energy to have a 
current flowing, i.e. a voltage drop to exist. 
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The voltage that drives the electro-chemical processes that generates the current 
flow is called the over-voltage. In these processes, several time constants are 
involved. The time constants show as complex electrical impedance. Normally, 
CCV (closed-circuit voltage) discharge curves of batteries are provided for a 
range of DC currents at a certain temperature. These do not provide sufficient 
information for dynamic current, i.e. having a non-constant profile. An 
equivalent model of a battery that consists of an EMF source and complex 
output impedance provides the proper battery response on any load or charge 
profile. Said EMF and output impedance depend on involved battery type, 
temperature, state-of-charge and lifetime. Internal losses in the battery exist 
according I«(OCV-CCV). Comparable issues are valid for FCs. 



8. Conclusions 

Power Management in portable systems is a necessary infra-structural provision 
for long operation time or small size belonging to the few distinguishing 
features. Improvements will become default requirements since users are getting 
used to a certain performance. The demand for additional (power-hungry) 
features necessitates more complex power management electronics in the future. 
Power management involves interaction across the whole system level and 
includes an increasing amount of electronic and non-electronic fields of 
expertise. An increasing part of the design trajectory of a system shifts to IC 
designers who are confronted with the design of the whole system. Making the 
many fields of expertise available in a proper format to IC designers is key to 
the competitiveness of their designs. Accurate and predictive physics-based 
models of all components involved, together with a toolset that covers the full 
design flow are required for a short design time and time-to-market. Due to the 
direct link between energy, power, size and cost, optimal design of power 
systems and associated management remains a challenge since it covers a wide 
range of disciplines. 
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PART III: Low-power and high-resolution ADC’s 



For ADC’s low-power and high-resolution are important specifications. In 
fact in most cases the ratio of the two is important. At the design of the 
program several aspects have led to the specific choice of speakers. First of 
all, top-class specialists, from all over the world, that are currently working 
in this field should be found. Second, the program should both show a 
tutorial character and have specialistic depth. Third: it should address both 
the developments in business and in the academic field, so the developments 
in the universities. Fourth, the relevant architectures should addressed. 
Finally, aspects ranging from application, architecture, algorithm, circuit 
design, and influence of technology trend should be addressed in a balanced 
way. I think we really succeeded in that. 

Most papers address sigma-delta converters, which are clearly the most 
popular ones nowadays in this field; one paper (Patrick Quinn, Xilinx) 
specifically addresses a new concept for algorithmic and pipelined 
converters. The applications range from communication (Kathleen Philips, 
Philips), via broadcast (Andrea Baschirotto, University of Lecce), to sensor 
application (Terri Fiez, Oregon State University). The paper of Ovidiu 
Bajdechi (Delft University, now with Broadcom) focusses on generic and 
systematic design aspects for sigma-delta converters, and another, the paper 
of Libin Yao (Katholieke Universiteit Leuven) discusses sigma-delta in the 
context of the evolution of CMOS processes. 

Arthur van Roermund 
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Abstract 

In a quest for flexibility, receivers for wireless communications 
are being digitized. Analog filters and variable gain amplifiers 
(VGA) are exchanged for digital processing at the expense of a 
very challenging ADC. Wireless interconnectivity in particular, 
seems a first candidate for a full-digital baseband implementation. 
Compared to mobile communication, sensitivity and interferer 
levels are moderate. Two converter solutions for a full-digital 
Bluetooth receiver are presented here. The first -rather 
conventional- ADC achieves high resolution and low power 
consumption by careful design. The second solution merges 
analog filtering and VGA into the ADC. This results in a further 
power reduction because the converter performance is better 
tailored to the input signal. 



1. Bluetooth receiver 

A conventional -highly analog- Bluetooth receiver is depicted in Figure 1. The 
baseband part consists of a cascade of filter and gain sections. The latter limit 
the signal to a pre-defined level. The bits are recovered by further analog 
demodulation (for example using a zero-crossing detector [1] or a frequency 
discriminator [2]) and by consecutive quantization. 




Figure 1 Conventional, dominantly analog Bluetooth receiver 
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Figure 2 presents an alternative concept based on sigma-delta A/D conversion. 
The sigma-delta loop resembles the baseband of Figure 1 in the sense that the 
loop filter takes the place of the cascade of filter and gain sections. The 
quantizer is sampled at a much higher rate mfs (m indicates the over-sample 
factor, fs is the Nyquist sample frequency equaling 2MHz for this low-IF 
Bluetooth receiver). In addition, overall feedback from the output of the 
quantizer to the first analog stage at baseband is applied. The feedback results in 
relaxed dynamic range (DR) requirements on the second, third and higher 
sections of the loop filter of the sigma-delta ADC because their noise and 
distortion is counteracted by the loop operation. Notice, this is similar to the 
conventional baseband: there the DR requirements on the consecutive sections 
decrease because of preceding filtering and (variable) gain. 




Figure 2 Highly digitised Bluetooth receiver with sigma-delta ADC 



Other requirements on the analog sub-blocks are more relaxed for the sigma- 
delta solution than they are for the conventional solution. For example, gain 
accuracy, accuracy of time-constants, offset, etc. are very critical in the 
conventional baseband. Likely, multiple calibration loops need to be 
implemented. On the contrary, the gain accuracy of the loop filter is not 
important for a single-bit sigma-delta ADC and only a moderate accuracy is 
required for the loop filter time-constants. Furthermore, offset in the sigma- 
delta ADC is primarily due to the input stage and the amplification of the offset 
to the output is less than in the conventional baseband. 

The above gives an intuitive reasoning why a high performance sigma-delta 
ADC in combination with digital demodulation can be a low power alternative 
for a conventional analog baseband in combination with analog demodulation. 
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2. High performance SA ADC with complex loop filter 

A single-bit, continuous-time sigma-delta ADC is presented here (Figure 3, [3]). 
At a sample rate of 64MHz it needs a fifth-order complex loop filter to meet the 
required DR. The loop filter realizes high gain within the conversion bandwidth 
from 0 to IMHz resulting in aggressive shaping of the quantization noise. The 
magnitude of the complex transfer function is depicted in Figure 4. Four filter 
notches are distributed evenly over the conversion bandwidth. One notch is put 
at the edge of the image band (i.e. at -IMHz). It -moderately- attenuates the 
noise in the image band such that a certain ‘’leakage” due to finite matching 
between the I- and the Q-path is acceptable [4]. 
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frequency (MHz) 

Figure 4 Transfer of complex loopfilter 



2.1. Implementation 

The loop filter consists of gmC sections, except for the first stage [5]. The first 
stage is an operational transconductance amplifier (OTA) in a negative feedback 
configuration. It is of utmost importance to the performance of the sigma-delta 
ADC. This is motivated next. 



First of all, the input transconductance gm contributes importantly to the overall 
noise and therefore must be large. Also for linearity reasons a large gm is 
favourable. Based on [ 6 ] the third-order inter-modulation product can be 
calculated as a function of the input signal Vin, the gm and vqt of the input 
transistor and the resistors Rmand Rfb (see Figure 3): 
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For a fixed bias current IM 3 improves quadratically with vqt while it is cubic in 
gm. Hence, the input transistor should be biased near weak-inversion to achieve 
lowest distortion. In that case, the linear input range, i.e. Vgt, decreases but gm 
grows. A larger gm results in a smaller error signal at the virtual ground input. 
Both effects are counteracting but the latter is dominant because of the cubic 
relation. Hence, gjl =10 is chosen (with the bias current I=500 |liA)^ 



^ When biasing the input transistors in weak-inversion vqs should be replaced 
by nkT/q in the formulas. 
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Hence, Vgt should not be too small either to prevent inter-modulation of 
interferers at high frequencies causing spurious components within the 
conversion bandwidth. 



Furthermore, the input stage must provide a virtual ground summing node for 
the input current and the feedback current. The virtual ground node must be 
guaranteed over a wide bandwidth. In this particular receiver, the ADC is not 
preceded by any channel filter. Therefore, the bandwidth of the overall input 
signal, including interferers, can be large. It may extend over the entire 
Bluetooth operation band of ~80MHz. (For clarity, the conversion bandwidth of 
the ADC only corresponds to a single IMHz wanted channel.) Also, the 
feedback signal contains a lot of energy at high frequencies. The bandwidth of 
the input stage must be large compared to that of the input signal and that of the 
feedback signal. This favors the use of single stage solutions such as a telescopic 
or a folded cascode topology. 

A folded cascode stage can accommodate a large output swing for the 
integration while a telescopic cascode only takes a minimum number of current 
branches. Because the first stage dominates in the overall current consumption 
(due to the high g„) the latter argument prevails. This choice is a key reason for 
the power efficiency of the presented ADC. 

Table 1 gives a further comparison of some candidate topologies for the first 
stage. The telescopic cascode features a large bandwidth and a minimum bias 
current for a target gn,. In addition, it has a minimum number of noise sources. 
The telescopie cascode is likely to be somewhat less linear because of a tighter 
output range. The dominant non-linearity, i.e. the input transconductance, is the 
same as in the other topologies though. 



Table 1 Circuit topologies for the input stage 





2-stage 


Folded cascode 


Telescopic cascode 


BW 


- 


+ 


+ 


Ibias (for fixed g^) 


- 


- 


+ 


Noise 


- 


— 




Distortion 


+ 


-f 


- 
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Figure 5.a shows the schematic of the first integrator in the loop filter. All 
sources are cascoded to improve the output impedance and to decrease the 
influence of its non-linearity. The n-MOS current source is degenerated with 
transistors in the triode region providing the output common mode control. The 
control relies on matching with a replica bias circuit. The input common mode is 
set by the DAC output. Noticeably, a differential output swing of O.SVpp is 
allowed even though a lot of devices are stacked in the circuit. 




(a) (b) 

Figure 5 Implementation of the first (a) and the following integrators (b) 



Basically, the second and higher sections of the loop filter consist of down- 
scaled copies of the first stage (Figure 5.b). Because their noise and distortion is 
suppressed by the preceding gain in the loop they can be biased at a 10 times 
lower current. Contrary to the first stage the input of these OTAs is not a virtual 
ground node. Instead, there can be a large signal swing on the input node since it 
is connected to the output of the previous integrator. The transconductances are 
degenerated to increase their linear input range. As a consequence, all time- 
constants of the loop filter are set by RC-products and therefore match well. 

The feed-forward coefficients are based on degenerated differential pairs. These 
feed their output current directly into a current-mode latch that also serves as a 
summing node [7]. 
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The single-bit output code is fed back to the input by a resistive DAC: the 
resistor Rfb is connected to a positive or a negative reference voltage depending 
on the output code. A retum-to-zero scheme is applied to reduce inter-symbol- 
interference. 



2.2. Measurement results 

Figure 6 shows the measured SNR as a function of the signal level. The latter is 
normalized to the full-scale of the digital output code: at a 70% modulation 
depth of the digital output (i.e. -3dB digital full-scale) a peak-SNR of 75.5dB is 
achieved. 



CD 

Pi 

00 




output signal level (dB) 

Figure 6 SNR as a function of the output level 



Figure 7 shows the output spectrum of a two-tone test: two input signals are 
applied at -9dB, their IM 3 products are smaller by 82dB. 





280 



m 

w 

Ok 

s 

< 



IM 3 < -82dB, 

DC offset 




fl.f2 

2f,-f2 



2 4 

frequency (MHz) 



Figure 7 Output spectrum for IM 3 test 



Finally, Table 2 lists some conventional ADC performance metrics. 



Table 2 Performance metrics 



input signal 


0-1 MHz, 0.35 V„ns(per channel) 


sample frequency 


64 MHz 


dynamic range 


76 dB 


SINAD 


75.5 dB 


IM 3 


<-82 dBc 


aliasing spurious 


<-75 dB 


power consumption 


4.4mW@1.8V 


die area 


0.22 mm^ 



Since the ADC is used without preceding filter also the performance with 
respect to interferers must be evaluated. As the interferer channels grow they 
cause over-load of the DAC and the quantization noise rises. The allowed 
interferer level is frequency dependent because the signal transfer function 
(STF) from the input to the output of the ADC is so. Moreover, due to the 
complex loop filter the STF -and therefore the allowed interferer level- is 
asymmetrical around DC. This is illustrated with the measurement results in 
Figure 8 . 
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Bluetooth mask: | | frequency (MHz) 

Figure 8 Allowed input level over frequency such that the DR in [0; IMHz] does 
not decrease 



Furthermore, Figure 9 shows the results of an interferer test: an input signal near 
the clock frequency is applied. It is attenuated in the continuous-time loop filter 
before being sampled in the quantizer. The remaining aliasing is counteracted by 
the loop operation such that it appears at -75dB. (The input level of the 
interferer corresponds to the level specified by the Bluetooth standard.) 




Figure 9 Anti-aliasing test with an input signal at -7dB (complying to a 
Bluetooth test) and 370kHz offset from the clock frequency 
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2.3. Evaluation of performance/power 

The above ADC achieves a high DR and excellent linearity while consuming 
only 4.4mW. This is state-of the-art when comparing to [8] (achieving the same 
DR at a five times higher power consumption) or [9] (achieving 12dB less DR at 
only twice less current). 

Still, it does not necessarily provide the most low-power baseband architecture 
for the receiver. This is intuitively understood from the example of the input 
spectra of Figure 10. Suppose the wanted signal is small but a large interferer is 
present (Figure lO.a). The full-scale input level of the ADC must be designed to 
accommodate the large interferer while the noise must remain low compared to 
the wanted signal. Alternatively, suppose the wanted signal is strong (Figure 
lO.b). It is converted with a SINAD of 75.5dB while only some 18dB is required 
for demodulation. In both examples, part of the DR is ‘’wasted” for the 
interferers. 



ADC input 

(a) 



UL 



ADC output 
full-scale J_, 



freq 



31 . 



freq 



(b) 



SINAD 



freq 




□ wanted chaimel 
■ interferer channel 

Figure 10 Example input spectra and corresponding output 



The ADC that is presented next merges some explicit analog filtering and 
variable gain control into the sigma-delta loop. By consequence, the input range 
is better tailored to the expected spectrum of the input signal including 
interferers. This results in lower overall power consumption. 
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3. Filtering sigma-delta ADC 

A low-pass filter Hlpf(s) and a compensating high-pass filter Hhpf(s) are added 
to a conventional sigma-delta ADC as depicted in Figure 11, [10]. The sum of 
these filters equals 1 over the entire frequency range such that the stability and 
the noise shaping of the loop remain unaltered from that of the conventional 
ADC. The STF of the new topology is different though: 

^'^^filterADC (^) “ ^'^^conv .ADC (®) ■ ^LPF (s) (3) 




Figure 11 Concept of the sigma-delta ADC with explicit filtering 

The STF features explicit low-pass filtering. The -3dB-frequency of Hlpf(s) is 
chosen such that interferers are attenuated towards the output^. This is shown in 
Figure 12 for a filtering ADC based on a conventional design with a fourth order 
loop filter, to which a first-order low-pass filter and the compensating high-pass 
filter are added. As a consequence, interferers can be applied at a higher level 
than in the conventional ADC without over-loading the DAC. The maximum 
allowed input level -without increasing the noise in the wanted channel- equals 
the full-scale level for the wanted channel divided by the STF of the filtering 
ADC. Hence, it is inversely proportional to the curve of Figure 12. This will be 
illustrated with measurements later on. Notice, a fourth-order, real loop filter is 
used instead of the fifth-order complex filter for the previous ADC. Again, this 
is commented further on. 



^ Only for nearby frequencies the attenuation is limited. This is due to the 
overshoot in the STF of the conventional ADC that is taken as a starting point. 
In case a less aggressive loop filter were chosen the overshoot would be less. 
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Figure 12 Signal transfer function of the filtering ADC, of the conventional ADC 
and of the first-order low-pass filter 



3.1. Implementation 

The implementation of the loop filter is similar to that of the previous ADC. In 
fact, the same building blocks are re-used and are scaled were appropriate. 

A simple first-order low-pass filter with a -3dB-frequency of 3MHz and the 
compensating high-pass filter are added. A passive implementation of Hlpf(s) 
becomes possible when shifted behind the first integrator stage (Figure 13). The 
first integrator stage must then be duplicated in the compensating path. There, 
the series configuration of Hhpf(s) and the integrator results in a low-pass filter 
allowing an easy implementation. 
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Figure 13 Block diagram of filtering sigma-delta ADC 



The added filters hardly increase the overall power consumption of the ADC. 
The noise and distortion of the passive filter is suppressed by the preceding gain 
of the first integrator^. The same is true for the OTA in the feedback path. 
Hence, this active stage can be biased at only 50pA. Only the first integrator 
remains critical for the same reasons as discussed for the complex ADC in the 
previous section. 

The filtering STF of this ADC is optimally exploited in combination with 
variable gain control of the input signal. The input resistance is switched 
between IkD, lOkO and lOOkO depending on the amplitude of the input signal. 
As such, the input range for wanted signals is scaled from 5mV to 50mV and to 
500mV. The interferer signal can be larger by at least 20dB per decade of 
frequency offset firom the -3dB-frequency of Hlpf(s). The characteristic of the 
allowed input level over fi’equency (i.e. the inverse of the curve in Figure 12) 
must be tailored such that it optimally accommodates the entire input spectrum. 
This is achieved by the choice of Hlpf(s) and -to a lesser extent- by the design of 
the STF of the conventional ADC. 



^ Notice, this would not have been the case if the passive filter were preceding 
the ADC. Then, it would contribute in the overall noise. 
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The bias current of the first stage is adapted dynamically to the input range in 
order to scale this dominant noise source. The bias current equals 500pA for the 
smallest input range and 200p,A otherwise. The lower limit on the bias current is 
set by a bandwidth limitation. The dynamic biasing lowers the average power 
consumption. 

Notice, a fourth-order, real loop filter is used instead of a fifth-order complex 
filter in the previous section. Because of the integration of the filter and VGA a 
reduced output SINAD is allowed while the input-referred DR remains large. 
Hence, the noise shaping can be relaxed. (Potentially, also the sample rate can be 
reduced. In this design, it remains 64MHz though.) By consequence, the 
decimation filter becomes simpler as well. Moreover, the accuracy of the DAC 
references can be relaxed because the clock jitter and the noise on voltage or 
current references is related to the output SINAD instead of the much larger DR 
of the input signal. This results in a major power reduction in all these circuits. 

As a summary, the integration of a filter in the sigma-delta loop in combination 
with variable gain control of the input signal and adaptive biasing results in a 
lower average power consumption of this ADC compared to the ADC of the 
previous section. Its peak consumption has hardly increased. In addition, the 
power consumption in the decimation filter and all DAC references decreases. 



3.2. Measurements results 

The filtering behavior of the ADC is demonstrated in the three-tone test of 
Figure 14: a small wanted signal of 5mV at fi=700kHz is applied as well as 
interferers of 30mV and 120mV at f2=4.8MHz and f3=10MHz respectively. The 
input resistance is switched to IkQ such that the wanted signal appears at -3dB 
of the digital full-scale output. The interferers though, are attenuated to -9dB 
and -lOdB respectively. Within the conversion bandwidth of the ADC the noise 
increases by only IdB due to spurious components from the generator. The 
inter-modulation tone at is at -58dB. 
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frequency (Hz) 

Figure 14 Three-tone measurement proving filtering transfer and linearity 
(fi=700kHzj2=48MHz andfs^lOMHz) 

Figure 15 plots the measured allowed input level over frequency such that noise 
and spurious do not increase within the conversion bandwidth. The 
measurements are shown for Rin^lkQ: especially in the case of a small wanted 
signal the interferers are likely to be larger. The allowed interferer level is 
compared to that for a conventional (i.e. without Hlpf(s) and Hhpf(s)), fourth- 
order sigma-delta ADC in order to demonstrate the improvement. For the other 
settings, i.e. Rin=10kf2 and Rin^lOOkD, the allowed level of nearby interferers 




Figure 15 Allowed input level over frequency such that DR in the wanted 
channel is not jeopardized (for gain setting Rj„=lkT2) 
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follows the same curve; only, the absolute value is larger by a factor 10 and 100 
respectively. Reliability issues proper to the technology limit the allowed level 
of far-off interferers. 

Figure 16 shows the measured signal-to-noise (SNR) and the signal-to-noise- 
and-distortion (SINAD) as a function of the input level for the three modes. An 
overall, input-referred dynamic range of 89dB is achieved. The SNR and 
SINAD at the output are moderate because of the integrated filtering and 
variable gain control. Hence, this filtering ADC constitutes a true equivalent of 
the conventional cascade of analog filter, VGA and ADC in the sense that it 
reduces the signal dynamic range from input to output. The moderate SINAD at 
the output of the ADC results in a low-power ADC (see Table 3), a high 
tolerance of noise and jitter on the DAC references and relaxed attenuation 
demands on the decimation filter. This will be demonstrated in the next 
paragraph. 




Figure 16 Measured SNR (SINAD) as a function of the signal level 



Notice that the peak-SNR and the DR differ per gain setting (see Table 3). This 
is due to the fact that the relative contribution of thermal noise -compared to 
quantization noise- is different for the various gain settings. For Rin^lOOkQ 
quantization noise is dominant. For Rin=lkD the input signal is 100 times 
smaller while the bias current of the first stage is only 2.5 times larger. As a 
consequence, the thermal noise power associated with this stage is significantly 
larger than the quantization noise. As a second order effect, also the influence of 
the other stages of the loop filter becomes more important: as Rin is smaller their 
input-referred contribution scales as well. The quantization noise though. 
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remains the same for all gain settings because the loop parameters (i.e. the time- 
constants) are unaltered. 

4. Evaluation of performance/power 

Table 3 summarizes the measured performance of the ADC for the three gain 
settings. These numbers only relate to the performance with respect to the 
wanted channel. In addition to that, the performance in the presence of 
interferers has been evaluated: the filtering STF provides immunity to interferers 
above the full-scale level for wanted signals as characterized in Figure 8. The 
combination with passive, programmable gain enables an input referred, overall 
DR of 89dB for the wanted signal while only consuming 2mW (or less 
depending on the gain setting). As discussed, the DR for interferers is even 
larger. 



Table 3 Performance of the filtering ADC for the three gain settings 



Rin 


VlN,max 


DR 


SINAD 


P(@1.8V) 


100 kD 


500 mV 


65 dB 


59 dB 


1.5 mW 


10 kQ 


50 mV 


59 dB 


57 dB 


1.5 mW 


1 kn 


5 mV 


49 dB 


46 dB 


2 mW 



5. Comparison of the ADCs in terms of performance/power 

The presented ADCs target the same application. Moreover, they largely consist 
of the same circuits and are implemented in the same 0.1 8pm CMOS 
technology. Hence, a comparison of the performance/power between both is 
straightforward. It is listed in Table 4. Since quadrature conversion is targeted 
the numbers in the right-hand column of Table 4 refer to the power, area and 
performance of two filtering ADCs together. At a comparable area and power 
consumption the solution with the filtering ADCs achieves a 16dB higher DR 
(input-referred) and a better tolerance to noise on the DAC references. 

The SINAD provided at the output of the filtering ADC is much smaller because 
of the integrated filtering and VGA. It is still by far sufficient though for the 
demodulation of a Bluetooth signal. This moderate SINAD is intentional and 
results in a major power saving in the ADC and on a system level. First of all, 
the requirements on the consecutive decimation filter can be relaxed: only a 
moderate attenuation of quantization noise and interferers is needed. This 
translates in a lower power and area for this block. More important even, noise 
and jitter on the references (i.e. voltage/current/charge and time) for the feed 
back DAC become less critical since the DAC only needs a moderate accuracy. 
The last row of Table 4 indicates a major improvement with respect to noise and 
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jitter. Especially for the clock generation this again results in significant power 
saving. 



Table 4 Comparison of conventional complex ADC and filtering ADC with 
merged VGA 





Complex ADC 


2 filtering ADCs 


p 


4.4 mW 


<4.1 mW 


die area 


0.22 mm^ 


0.28 mm^ 


DR 


76 dB 


92 dB 


SINAD 


75.5dB 


>46 dB 


allowed noise and jitter on DAC 
references 


< 0.02 % 


< 0.4 % 



6. Conclusions 

Two ADCs have been presented. The first one combines a high SINAD with low 
power consumption. These features result from the aggressive loop filter and the 
low power circuit design. The latter is achieved by biasing the critical 
transconductors near weak-inversion and by using a minimum number of current 
branches. 

In the second ADC some VGA and filtering of interferers are merged into the 
sigma-delta loop. This architectural innovation yields 16dB more DR and an 
overall power saving. The power saving results from a lower average power 
consumption in the ADC and from relaxed requirements on peripheral circuits. 
The technique is particularly interesting in wireless applications: these typically 
require a large input-referred DR while only a small SESfAD is needed for 
demodulation. 

Although the first ADC already achieves an excellent power/performance ratio 
the comparison proves that by tailoring the ADC to the specific needs of the 
application a further, major improvement is possible. While circuit level 
optimization improves the power/performance ratio of ADCs in an evolutionary 
way architectural innovation may yield a leap step. 
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Abstract 

This paper describes the systematic design of E A analog- to-digital 
converters (ADC), from the top level of abstraction represented 
by the filters defining signal and noise transfer functions, passing 
through the architecture-level, where topology-related performance 
is calculated and simulated, and finally down to circuit parameters. 

The systematic approach allows the evaluation of different loop fil- 
ters and quantizer resolutions, mapped on single-loop or cascaded 
topologies with both discrete- and continuous-time loop filters. 

1. Spectral Noise Shaping 

Analog signals are continuous both in time and amplitude and their spectmm 
contains non-zero tones in a finite frequency band as an effect of their continu- 
ity in amplitude. In Fig. 1 the input signal spectrum is placed around DC (the 
void trapezoid) and extends to fb in the positive spectrum. The analog-to-digital 
conversion requires the analog input signal to firstly be sampled by a sample- 
and-hold (the block S/H) which transforms it into an analog, discrete-time sig- 
nal, only changing its amplitude at periodic intervals Tg (set by the sampling 
frequency /^, Tg = l//s). Because sampling introduces instantaneous amplitude 




Figure 1: Noise shaping ADC 
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changes in the analog signal, the spectrum of the sampled signal has infinite band- 
width, by replicating the input signal spectrum around the multiples of the sam- 
pling frequency (shaded spectrum around fs). The sampled signal is quantized by 
the ADC, being quantized into a discrete-time, discrete-amplitude (digital) form. 
From a ’’spectral” point of view, the quantization introduces quantization noise in 
the signal bandwidth which limits the conversion resolution (as the ratio between 
a reference level and the minimal signal that can be converted). To reduce the 
in-band noise power, the ADC operates faster than 2/^, as an oversampled ADC. 
In this case, the oversampling ratio (OSR) is a design parameter showing how 
many times is fs larger than the minimal value required by the Nyquist theorem 

OSR = A (1) 

^Jb 

With oversampling, only a part of the quantization noise power falls in the signal 
bandwidth. Furthermore, for large OSRs, the S/H block is not really required 
since the input signal changes little during one Tg period. Given enough OSR, 
the ADC resolution n can be eventually reduced to 1 bit, while attaining the final 
target resolution N (after the decimation filter). 

To reduce the required OSR needed for a conversion resolution N, keeping a 
constant number of ADC bits n, noise shaping is applied. This is accomplished 
by high-pass filtering the quantization noise to displace most of its power from 
low frequencies where the input signal spectrum is placed to higher frequencies 
close to /s/2, as shown in Fig. 1 by the bell-shaped shade centered on /s/2. 
The amount of quantization noise power still left inside the signal bandwidth Qn 
depends on the exact filtering applied in terms of filter order and cutoff frequency. 

One method to attain a high-pass noise filtering is to implement a sigma-delta 
(S A) loop around the quantizer, with a loop filter setting the noise shaping. S A 
ADCs are well-studied and versatile architectures which only miss one important 
feature: a highly-accurate analytical model. 

2. Filter-Level Analysis 

2.1. Linear Model. Transfer Functions 

A generic discrete-time representation of a E A ADC, given in Fig. 2, is best used 
to explain the functioning of the E A ADC modeled as a linear system. The loop 
filter has two sections, a forward filter G(z) and a feedback filter H(z). The input 
signal X(z) is applied and compared with the signal fed back by H(z), filtered 
through G(z) and quantized to give the digital output. The quantization introduces 
an error E(Z) which is modeled as input-signal-independent and directly added 
to the output, in the quantizer (represented as a summation point). 
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Figure 2: Noise shaper 

The two EA transfer functions are defined on the system above: the signal 
transfer function to characterize the transfer from X(z) to Y(z) and the noise 
transfer function for the partial contribution of E(z) in Y(z) 

Y(z) = STF{z)X{z) + NTF{z)E{z) (2) 

These functions can be independently defined because a linear model for the 
quantizer is assumed, thus making the whole system a linear one where super- 
position rules apply. The definitions of the two transfer functions are based on 
the two sections of the loop filter: 



Ql^TZ('y\ G{z) 

D1 r yZ) — 



NTF{z) — 



( 3 ) 



The STF can be approximated with 1 (one) at frequencies where G(z) is large 
and H(z) is unity, while, at the same frequencies, NTF can be approximated with 
0 (zero) [1]. Note that, if the filters are implemented using integrators, the fre- 
quency band where G(z) is large is around DC (see Fig. 3). If resonators are used 
instead of integrators, the central frequency for the region of interest is shifted 
around the resonance frequency of the resonators. The latter approach allows 
for bandpass converters to be designed. The feedback filter H(z) is usually not 
implemented separately, but as distributed feedbacks into the G(z) [3]. 

Typical shapes for NTF and STF are shown in Figure 3. The full line repre- 
sents the STF while the dashed line is a typical NTF for low-pass S A converters. 
The important parameters of the two curves are shown, for STF the DC value 
and in-band ripple and for NTF the fs/2 gain and out-of-band ripple. It is worth 
noting that the STF gain is dropping out of the band, rejecting high frequency sig- 
nals. When STF is not well controlled it can also show an overshoot just above 
signal band limit, which is tightening the rejection specification for the digital 
filter following the converter [1]. 

The steepness of the NTF curve inside the signal band is given by the order 
of the loop, which is the order of the numerator of the product G(z)H(z) (see 
Fig. 2) multiplied by 20dB/dec. The poles of this product give the zeros of NTF 
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while the zeros of the G(z)H(z) product affect the poles of NTF. Using resonators 
inside the G(z), the NTF zeros can be moved away from DC and spread inside the 
signal bandwidth in a manner which minimizes total quantization noise power. 
This noise power, scaled by the quantization noise spectral density, is represented 
on the graph in Fig. 3 as the shaded triangle marked Pqnoise which gives the hard 
(theoretical) limit for the conversion resolution in the given band, for a given NTF 
and quantization noise total power. 
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Figure 3: NTF and STF amplitude characteristics for a low-pass S A ADC 



A large STF ripple is not convenient for most applications, which do not tol- 
erate frequency-dependent in-band gain. For certain applications the STF phase 
is also important, as is the case with audio applications. The STF out-of-band 
attenuation can decrease the risk of input signal aliasing and relax the specifi- 
cations for the digital decimating filter following the ADC, but is not a design 
goal when signal conditioning is available in front of the ADC. The out-of-band 
NTF magnitude influences the amount of quantization noise re-circulated on the 
loop [1]. A large magnitude around fs/2 will put more noise at the input of the 
quantizer, thus reducing the maximum signal level which can still be processed. 
A high out-of-band NTF ripple makes the NTF shape more sensitive to process- 
induced pole shifts, possibly turning a stable design into a less-stable or even 
unstable implementation. 



2.2. Design of Loop Transfer Functions 

The NTF is the filter defining the resolution and other properties of a S A ADC, 
hence the design process always starts by defining this transfer function. Even 
if different filter families (Butterworth, Chebyshev or elliptic) can be used, an 
NTF filter has to be causal and scaled to yield an impulse-response starting with 
a unity output. The causality is required for a filter mapped to physical systems 
and implies that there cannot be more zeros than poles in the NTF[z), The 
requirement of initial unity impulse-response stems from the architecture of a S A 
ADC, as shown in the previous section. Fig. 2. There is no direct (zero-delay) 
feedback from the quantizer output to the quantizer input. All the feedback is 
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supplied through the loop filter G{z)H{z) which contains at least one delay in 
its forward path. 

The required scaling of the NTF filter introduces a relationship between its 
cutoff (-3dB) frequency and its magnitude at half the sampling frequency /s/2. 
An increased cutoff frequency induces a larger magnitude at /s/2, as shown in 
Fig. 4 for the NTF of a fourth-order SA ADC with Chebyshev poles. The 
full line has the lowest cutoff frequency and the smallest magnitude in the same 
time. It is the least aggressive NTF in the set shown. The legend in the graph 
also shows simulated DR for the three curves designed into single-bit EA ADC 
architectures, and the least aggressive curve offers the lowest DR (66dB) from the 
three. This is caused by a larger in-band (DC to Signal BW limit) quantization 
noise power compared with the more aggressive members of the set. 




Figure 4: Choice of NTF aggressiveness for a fourth order EA ADC 



2.3. Continuous-Time Loop Filters 

Starting from DT loop filters, continuous-time equivalents can also be built. A 
noise shaper having the loop filter implemented with CT circuits still remains a 
sampled system [7], as shown in Fig. 5. Its output Y (s) supplied by the quantizer 
only changes on the edge of a sampling clock signal with period Tg. Even with 
a passive DAC (a pair of resistors for example), the sampled nature of Y (s) 
produces a DAC output D[s) which is sampled with the same Tg. Assuming the 
input signal X (s) is largely oversampled, the E A ADC in Fig. 5 can be designed 
to be equivalent with the DT-based version in Fig. 2. Noting } the inverse 
Laplace transform for CT transfer functions and the inverse .2 transform 

for DT transfer functions, a mapping is performed between the complete transfer 
functions NTF and STF. By designing the entire NTF{s) and STF{s) to show 
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Figure 5: Noise shaper with continuous-time loop filter 



identical impulse responses: 

Z-^{NTF{z)] = C-\NfF{s)}\t=nT. 

Z-^{STF{z)} = C-^{SfF{s)]\t=nT^ 



(4) 



the partial contributions of the input signal and of the quantization noise to the 
quantizer input are identical in the two designs. This opens good comparison 
possibilities between the functioning of the DT and CT equivalent loops because 
the poles and zeros of the entire transfer functions are matched. 

The direct mapping of NTF and STF assumes a constant DAC output during 
one clock cycle. However, retum-to-zero (RTZ) DAC waveforms may be needed 
in high-DR CT designs. RTZ DACs switch from zero to nominal value during 
any clock period and, with such DAC waveforms, the design method has to be 
adapted to consider the Laplace transform of the DAC impulse response. 



2.3.1. Fundamental Performance Limitation Compared to DT Loops 

It has been mentioned before that a mapping of complete NTF and STF from 
realizable DT loop filters to CT equivalents opens the possibility to compare 
the performance of two equivalent implementations of virtually the same design. 
’’Virtually” highlights the fact that an equivalence is only valid when the linear 
model holds in both cases, which is not true with large input signals. Therefore, 
it is important to analyze the performance of the two EA ADCs, with equiva- 
lent DT and CT loop filters respectively, as independent designs and compare 
the results from time-domain simulations which account for the non-linear loop 
transfer. 

The range of usable NTF aggressiveness (as defined by the lower and upper 
limits of peak NTF magnitude yielding a stable NTF) has been found to vary 
with the order of the loop, the number of bits in the quantizer, as well as with 
the nature of the loop filter, DT or CT. This is proven in Fig. 6 for a set of 
fifth-order, single-loop modulators, with the number of quantizer bits ranging 
from 1 to 8. For any number of bits in the quantizer, the CT equivalents cannot 
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be designed as aggressive as the DT designs. Also, the range of usable NTFs 
increases very fast with the number of quantizer bits for DT designs but very 
slow for the CT equivalents. These results are valid for NRZ, CT DAC pulses. 
Independent simulations using RZ DAC pulses proved to yield results closer to 
DT designs, intuitively placing the cause of the effects explained for the NRZ 
designs in the DAC pulse shape. 




Figure 6: PeakNTF magnitude ranges for DT and CT loops ofORDER=5 

2.3.2. Sensitivity to Initial Accuracy of Implemented Coefficients 

One issue which is not present with DT loop design is the initial accuracy of all 
loop coefficients implemented in the final circuit. As opposed to DT loops, where 
all coefficients are implemented as ratios of either capacitors or currents [3] [9] 
[10], in CT loops the typical coefficient implementation is ultimately a resistor- 
capacitor (RC) product [11]. If no calibration is available for the RC product 
[12], the two on-chip passive components show independent spreads from their 
designed values. As a result, all RC products are within 30% in error from their 
nominal value. 

The method to cope with this spread of loop coefficients is based on the as- 
sumption that spread is the same across a processed wafer. Since all the resistors 
and capacitors (of the same type) on the chip have identical initial accuracies, 
the feedback and the forward (and feedforward) paths keep the same gain ratios. 
The shapes of NTF and STF are not affected, but the NTF comer frequency is 
changed so enough margin should be considered for the quantization noise power 
when the high-pass filter is designed. The output of all integrators scales with the 
initial accuracy of the coefficients, so care should be taken that even when the 
loop coefficients are 30% larger than nominal no clipping occurs. 
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3. Architecture-Level Analysis 
3.1. Non-Linear Loop Transfer 

The linear model only yields accurate results if the two signals entering the sys- 
tem and used to define the transfer functions, X(z) and E(z) in Fig. 2, are mutu- 
ally independent. This is true for small input signals, when the quantizer input 
is dominated by re-circulated quantization noise. At high input signals however, 
the signal at the input of the quantizer also contains some of the input signal 
spectrum. 

From the point of view of attainable DR it is obviously more convenient to use 
aggressive NTF filters. There is however a price to pay for the NTF aggressive- 
ness, and this is a reduced overloading level (OVL). A lower-than-OdBR OVL is 
produced by the modulation of the quantization noise with the input signal spec- 
tral components. A more aggressive NTF concentrates more quantization noise 
power around /s/2 for small input signal levels. Hence, at large input signal 
levels, larger spectral components will be found in the quantization noise spec- 
trum around the input signal frequencies, producing faster overloading due to 
increased non-linear SA loop transfer. Fig. 7 shows the variation of OVL as a 
function of the peak NTF magnitude (NTF magnitude at /g/2 for flat out-of-band 
NTF) for a fourth-order single-bit S A ADC. The peak NTF magnitude is used to 
measure its aggressiveness rather than the cutoff frequency in order to highlight 
the effect explained above. The number of bits in the quantizer has to be taken 
into account since it changes the power of the quantization noise and the tonal 
behavior of the quantization noise at large input signals. This curve, connected 
with the DR values shown in Fig. 4, shows the trade-off between the attainable 
DR and peak SNDR (if limited only by OVL). 

The graphs in Figs. 4 and 7 also contain the limits of the usable NTF ag- 




Figure 7: Overloading level variation with NTF aggressiveness 
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gressiveness for a fourth-order, single-bit S A ADC. It is impossible to design a 
stable DA ADC with this architecture if the peak NTF magnitude falls outside 
the shown limits [1]. 



3.2. Performance Metrics 

During the design and characterization of a S A ADC, a number of performance 
metrics are collected from output spectra. Such a spectrum is shown in Fig. 8, 
generated by a fast Fourier transform (FFT) of the digital output of a fourth-order 
EA ADC with a 5 -bit quantizer. The input signal is set close to the OVL, at 
-2dBR. The entire spectrum is scaled to bring the reference level to OdBR. On 
such a graph it can be measured if the noise shaping has the correct order (here, 
80dB/decade outside the signal bandwidth), showing the loop is not overloaded. 
The resolution and the distortion are also measured on this spectrum. 

As with other ADCs, the resolution of a SA ADC is the measure for the 
smallest analog signal which can be converted. Only in the case of EA convert- 
ers, due to the quantization noise particular shape, the definition of the resolution 
does not start from the noise floor level, but rather from the total in-band noise 
power which gives the dynamic range (DR) 

DR = -mog,, Q [dBR] (5) 

with being the input signal bandwidth {Signal BW in Fig. 3) and Y (/) being 
the quantization noise spectral power density, measured as the spectrum of the 
digital stream output with no input signal 

Because of the overloading effect, DR cannot be used to define the effective 
number of bits (ENOB) since the full input scale is not at OdBR. In the same time, 
DR does not explicitly include non-linearity information. To determine ENOB, 




Figure 8: FFT spectrum for the output digital stream 
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the ratio between the input signal power and the power of all the other in-band 
spectral components is calculated as the signal-to-(noise+distortion) ratio SNDR 

/ 1 rf = fin-^^f \ 

SNDR = lOlog,, - / Y{ff + DR [dB] (6) 

\^Jf = Un-Sf J 

with fin the frequency of the single-tone input signal, 6 f 3i small frequency shift 
and Y (/) the spectral power density of the output signal. The SNDR is used to 
approximately calculate the ENOB as 

r. r. (P^ak SNDR) - 1.76 Peak SNDR 

ENOB = (7) 

6.02 6 ^ ^ 

To fully characterize the E A ADC, two other curves are used: the variation of 
DR and SNDR with the input signal amplitude. A large variation of DR with the 
input signal amplitude shows a highly-aggressive NTF or circuit-induced non- 
ideal effects, the differential non-linearity (DNL) can be measured on the SNDR 
curve as local changes of the slope and integral non-linearity (INL) as a variation 
of the average slope from the desired conversion gain. 

3.3. Single Loop Topologies 

Different topological variants can be used to design ’’stable” (or rather ’’realiz- 
able”) SA ADCs. No topology design can stop the non-linear behavior of the 
E A loop to become apparent at high input signals, therefore the overloading level 
is less than OdBR for all implementations. However, some topologies behave bet- 
ter than others at large input signals in terms of OVL value, increasing it by one 
or two dBR. OVL is much tighter connected with the NTF aggressiveness than 
with the E A topology used. Why ’’realizable” is a better term than ’’stable” when 
it comes to characterizing a topology becomes apparent by taking a look at Fig. 
9 which is a typical fourth-order ’’realizable” topology, sometimes praised to be 
the most ’’stable” one due to its feedforward coefficients • • • 63 [5]. Without 
the feedforward coefficients, the only NTF that can be mapped on this topology 
would be ^ 

NTF{z) = (8) 

{z -1) + /iaia2a3a4 

The NTF(z) above offers very little freedom in designing its poles and is practi- 
cally impossible to map on a convenient filter shape like the ones in Fig. 4. 

3.3.1. Mapping of Transfer Functions to Loop Coefficients 

Starting from Fig. 9, the expressions of designable NTF and STF can be calcu- 
lated from Eq. 3 after expressing G(z) and G(z)H(z) as sums of products of loop 




303 




Figure 9: Distributed feedforward topology 



coefficients ai • • • a 4 and b\‘ The filter G(z) is the partial contribution of the 
input signal X(z) to the quantizer input 

b\ai b2ai02 , b^aia20s 01020^04^ 

+ (731)3 + -(73T)i W 

The other filter of interest, G(z)H(z), is 



G{z)H{z) = -fiG{z) 



( 10 ) 



By replacing the two expressions above in Eq. 3, the two transfer functions are 
easily written as NTF" {z — 1) and STF"(z — 1) 



NTF"{z-l) = 
STF'\z - 1) = 



0-1)^ 

E%4nj{z-iy 

T,L.m{z-iy 

E%,nj(z-iy 



( 11 ) 



with the polynomial coefficients calculated from the topology coefficients. The 
NTF and STF are only represented as functions of (z — 1) instead of z because 
they are easier to extract from the topology in this form. As such, they can still 
be mapped to an equivalent form of the filters generated from a set of target poles 
and zeros. 

A complementary topology replaces the three feedforwards with three feed- 
back coefficients, as shown in Fig. 10. The three feedback coefficients / 2 , /a 




Figure 10: Distributed feedback topology 
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and /4 are sufficient to offer control over the pole structure of the NTF and STF, 
hence this topology can also be used for EA ADC designs. Because only three 
feedback coefficients can be added, this architecture still lacks control over the 
STF zeros, independently of the NTF poles. To gain control over STF zeros as 
well, there is obviously need to use a more complex topology by connecting both 
distributed feedbacks and feedforwards. 

Note that, although all four NTF zeros are shown in Eq. 1 1 as being placed 
at DC (z=l), it is possible, with minor architecture changes, to spread the zeros 
inside the signal bandwidth and further reduce the NTF’s in-band mean value, 
thus reducing the total in-band quantization noise power. The use of a resonator 
(shown dashed in Fig. 10, loop coefficient r) introduces a zero in the NTF(z) at 

^in—band ~ \/l ~h Td^CL^ (12) 

Taking advantage of the usually small value of r caused by the small value of 
^in-band (especially at high OSR), the resonator can be completely ignored in the 
first mapping phase and only calculated and added after the integrator coefficients 
have been determined. Multiple resonators can be added to reduce the NTF in- 
band magnitude, the limit of their number being set by the loop order divided by 
two. From the circuit point of view, it is more efficient (both in area and power) 
to add resonators starting from the end of the loop filter. 

The mapping flow is therefore a prioritized list starting with the strict map- 
ping of NTF’s poles, followed by the optional mapping of STF’s zeros (depend- 
ing of design requirements and architecture complexity), and ending with the 
addition of resonators in case more resolution is desired. 



3.3.2. Noise and Distortion Contribution of Individual Integrators 



Conversion performance limits for a low-pass E A converter are set by the noise 
and distortion, which are contributed by all integrators in the EA loop. Noise 
is generated in each integrator by resistors and active devices (MOS transistors). 
Distortion is caused by large-signal non-linear behavior of active devices. The 
weights associated with the noise and distortion contribution of each particular 
integrator in a E A loop depend on the loop gain from the input to the respective 
integrator. 

For the following calculations only the white noise will be considered, gen- 
erated in a bandwidth larger than half the sampling frequency, /s/2. Noting the 
total noise power at the input of each integrator P^m P = 1 • • • 4 (for a fourth 
order loop as shown in Figs. 9 or 10) and knowing the coefficient of integrator i 
is ai, the power P^m can be referred to the input of the converter as 



P — P 

^ wrii 



(rf 

Vi=i “i / ~ 



( 13 ) 
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under the assumption that the signal bandwidth fb is much smaller than fs. 

The distortion power introduced by integrator i, noted for similarity with 
the noise contributions, refers back to the input of the loop by the worst-case 
integrator gain, which is at the end of the input signal bandwidth. Assuming a 
large OSR, the input-referred distortion gets a simplified form 



D, = D. 




7T ^2(z-l) 

osr) 



(14) 



3.4. Cascaded Topologies 

Cascaded EA ADC designs consist of a chain of E A ADC loops, only the first 
loop processing the input signal while the following ones process the quantization 
noise of the previous loop in the chain, respectively. In this way, the cascaded 
E A ADC designs maximize both the DR by using highly-aggressive (high cutoff 
frequency) noise shaping and the OVL by only applying the full input signal to 
a low-order loop placed as the first stage in the cascade. Cascaded designs are 
convenient in applications where a de-coupling between the increase in DR and 
decrease in peak SNDR is wanted at a fixed OSR, as is the case with broadband 
ADCs that reach into technology-limited OSR values. 



3.4.1. Digital Cancellation of Quantization Noise 

One possible topology for a fifth-order, three-loop cascade EA ADC is given in 
Fig. 1 1 . It consists of a first loop, taking the input signal X (z) and generating the 
digital stream Yi{z) which contains a delayed version of X{z) and quantization 
noise Ei{z) shaped by the second-order NTFi. The subsequent loops convert 
the un-shaped quantization noise of the previous loop in cascade. Therefore the 
OVL is only limited by the second-order first loop, a likely larger value than what 
can be obtained with a fifth-order loop. The digital part of the topology, at the 
right of the dashed boundary, cancels all lower-order shaped quantization noise, 
allowing only the fifth-order shaped noise generated in the third loop to still be 
present in the output digital stream Y[z), This is accomplished by combining the 
individual outputs Vi ...3 filtered through digital replicas of the analog NTF\ and 
NT F 2 to generate the output stream (assuming Q 2 and ^3 are 1 ) 

Y{z) = X{z)z-^ + Ez{z)NTFi{z)NTF 2 {z)NTF 3 {z) (15) 

According to this equation built on linear model of the loops in the cascade, only 
the quantization noise generated by the last loop in the cascade and shaped by the 
fifth-order product of all NTFs is present in the output stream, so the topology 
ideally attains the same DR as a fifth-order single-loop E A ADC with a quantizer 
having N3 bits. 
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Figure 11: Fifth-order, 3-loop cascade SA ADC 



Considering the non-OdBR OVL shown by all the loops in the cascade, the 
coefficients di,ci and d 2 , ci are not unity in real designs. The coefficients and 
gz are sized depending of d\, e\, c ?2 and 62 . In straightforward designs (no scaling 
of quantizer input in any loop) d\ = ei and d 2 = C 2 , so 



1 

92 = -r 92,^ 

d\ 



1 

did2 



(16) 



gi is only different from unity if there are differences in the reference level of the 
first and the subsequent loops. 



4. Discrete-Time Circuits 

4.1. Switched-Capacitor Summing Integrator 

The most widely used DT integrator is the switched-capacitor integrator, shown 
in Fig. 12 in its differential, summing version. It consists of an opamp connected 
in closed-loop with the integration capacitors Ci, which are used to accumulate 
charge, thus performing the low-pass function. For half the clock period (sam- 
pling phase) the input signal is sampled on the sampling capacitors Cs by closing 
the switches 5 and keeping the switches i open. During the next half clock period, 
the switches s are open and the switches i are closed, thus connecting Cs to the 
charge-transfer nodes that are the opamp inputs. Because during this integrating 
phase the sampling capacitors Cs are connected between ground and the virtual 
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Ci 




Figure 12: Summing SC integrator with two integration paths 



ground nodes (held at virtual ground by the large opamp gain), all the charge 
stored in Cs is added to the charge of Ci, modifying the output voltage accord- 
ingly. The transfer function can be written in Z-domain, considering 
as 



Hsc{z) = 



Cg Z 

Vin{z) 



-1 



Ci 1 — z 



-1 



(17) 



A similar transfer function can be written from the VR input to the output. Hence 
the integrator coefficient, important for the mapping of the loop filter to the de- 
signed topology, is given by the ratio of the two capacitors. This is convenient 
from the point of view of yield optimization due to a relative mismatch of on-chip 
capacitors as low as 0.1% for a good layout. 

The integrator in Fig. 12 completely separates the integration paths for the 
input signal and for the DAC reference voltage. Furthermore, the switching at the 
VR nodes is signal-independent, and so is the switching at the opamp’s inputs, re- 
ducing harmonic distortion. The only signal-dependent switching is introduced 
by the use of physically different switches at the VR nodes, but this has minimal 
effects due to their connection to low-impedance nodes. The noise power (re- 
ferred to the input of the integrator) introduced inside the signal bandwidth by 
this switching scheme is 



4fcT / Cft 

CsOSR V o'. 



(18) 



The circuit described above implements a DT integrator transfer function only 
if circuit non-idealities are neglected. But even if these non-idealities alter the 
function implemented in the final circuits, their effects on the overall conversion 
performance can be reduced by correct design of circuit components and signals. 
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4.1.1. Finite, Nonlinear Amplifier Gain 



Amplifiers designed in CMOS processes suffer especially from a low value of 
the DC gain caused by reduced output impedance of MOS transistors. This af- 
fects the transfer function of the SC integrator by an incomplete discharge of 
the sampling and DAC capacitors, Cs and Cfb in Fig. 12, during the integration 
phase. The input capacitance of the opamp also becomes important in high-speed 
CMOS processes. With the input capacitance of the opamp Cp, a more accurate 
expression of the integrator’s output at the end of the integration phase can be 
calculated: 



^out 




{VinCs + VnCfb) 



1 Ci-\-Cs-\-Cfi)-\-Cp 

^ Ci 



(l + 



1 Cj-^Cp \ \ 

Adc Ci ) ) 



(19) 



This expression can be placed in time-domain simulations of SA ADCs to 
show the impact of the finite amplifier DC gain on the conversion performance. 
Since Eq. 19 shows that a real integrator introduces both a gain and a pole error 
compared to the ideal integrator in Eq. 17, the finite amplifier DC gain affects the 
NTF characteristic and therefore decreases the DR. 

Traditionally, is said that gain linearity is not important in SC integrators. 
This is true as long as the DC gain of the amplifier is large enough to keep a 
low voltage at its inputs after the integration phase or as long as the gain, even if 
small, does not vary with the output voltage. Both these assumptions tend to be 
difficult to attain in high-speed CMOS processes. 



4.1.2, Finite Amplifier Bandwidth and Slew-Rate 

Switched-capacitor circuits count on fast charge transfer from the sampling and 
DAC capacitors to the integrating capacitor, and so far, this transfer has been 
considered to be much faster than the sampling speed. This is only true if the 
unity-gain frequency of the amplifier (loaded with all the switching capacitors) 
is extremely large compared to the sampling frequency, which is almost never 
the case. In SC circuits, the limited amplifier bandwidth and slew-rate introduce 
errors at each integration and sampling phase, in the form of harmonic distortion. 
In practical circuits the amplifier bandwidth is minimized to reduce power con- 
sumption, and the lower limit is set by the total harmonic distortion which can be 
introduced by the integrator. The distortion introduced by each integrator has to 
be analyzed separately and considering the associated loop gain. 

Settling errors are introduced both during sampling and integrating phases. 
In designs with static biasing and clocked with 50% duty cycle, the capacitive 
load of the amplifier during the integration phase is always larger than the load 
during the sampling phase so the former phase (which constitutes the worst case) 
must be analyzed to determine the settling properties of the integrator. 
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4.2. Power Consumption Analysis 

The transconductance of the opamp inside a SC integrator is a good model of 
the power consumption of the integrator, especially if the amplifier is a one-stage 
design. Starting from the noise budget available for the integrator, one can calcu- 
late the capacitors building the E A coefficients. With the capacitor values and the 
distortion requirements (translated in settling constraints), the gm of the opamp’s 
input stage can be calculated. 



4.2.1. Noise 



The noise budget allocated to each integrator (based on designer expertise or by 
CAD) is consumed by two components: the switch noise {kT/C noise) and the 
opamp noise. For a one-stage amplifier with a large input transconductance g^., 
the noise power depends only on the capacitive load 



2kT^ 



(20) 



with 7 the noise enhancement factor for short-channel transistors [3] (applicable 
for opamp input devices). The total noise power (wide-band) at the input of the 
SC integrator is 



Pn = ^KT 



1 

a 



1 + 



Cfk 

a 



-b 



7 

2Cd 



( 21 ) 



= + + + Q + (22) 

where Ci is a fraction of the integration capacitor (parasitic capacitance of Cd 
connected at the opamp’s output 



Cl = aCi (23) 

with a a technology-dependent percentage. Cp is also proportional to Ci (for a 
given Vga — Vt and UGB): 

Cp = gCi- (24) 

The Cs and C/t capacitors are related to the integration capacitor through the S A 
loop coefficients supplied by the architecture design (see Fig. 9) 

C, = aCu Cfb = afCi (25) 

and using these proportionalities the load capacitance can be written as a function 
of the integration capacitor, technology parameters and SA loop coefficients 

Cd = Ci ((a + af + g) {1 + a) + a) . 



(26) 
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Combining the expression of the noise power with the compact expression of the 
load capacitance, an expression of the integration capacitor as a function of the 
noise budget of the integrator can be written 

^ + 2 ((^-+ a/— 1) (1 + a) ?■«)) ■ 

As part of the total noise budget (allocated for the entire converter), the noise 
power inside the signal bandwidth is referred at the input of the SA converter 
using Eq. 13. 



4.2.2. Distortion 



Starting from a specification of harmonic distortion \HD 2 ,\ (only the third one 
since differential circuits are used), the linearity of the integrator can be expressed 
in bits as [14] 



B 



\HD:,\ [dBc] 



(28) 



The number of bits B is then used to calculate the number of time constants 
required by a single-pole system to settle to B bits [14] 



Nr = Bln{2) 



(29) 



which, for a settling time Tggttie requires an UGB 



UGB = 



Nr 

^settle 



(30) 



The concept of the single-pole system is true for single-stage amplifiers and is 
also a good approximation of the behavior of two-stage amplifiers with enough 
phase margin. 



4.2.3. Required OpAmp TVansconductance 



The Qm is then calculated from the required UGB, as 



9m 



= 2ttCciUGB = 2irCci 



Nr 

Tgettle 



(31) 



For a slewing followed by settling model [3] the time available for settling is 



'^settle — 



Tclk 

2 



'^slew 



(32) 



with Tsiew being the time needed for the opamp to recover from non-linear be- 
havior. Hence, the expression of results in 



_ 2TrCci {Nr - 1 ) 

Tclk _ t /+^ 
2 lo 



( 33 ) 
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where Iq is the biasing current (tail current) of the opamp MOS input pair and 
is the voltage at the input of the opamp at the beginning of the integration 
phase, and after passive charge re-distribution [3]. 

If MOS transistors operated in weak inversion are used in the input stage of 
the opamp, Qm = 10/q, and a compact expression for Qm is obtained 

9m = ^ ((iVr - 1) + lOF/) . (34) 



5. Continuous-Time Circuits 



5.1. Integrator Topology 



A fully-differential active-RC integrator for use in SA ADC design is shown in 
Fig. 13. Apart from the input signal path, Vm with V/I conversion performed 
by Ri, there is a second path which implements the SA DAC, in this case only 
one-bit, with the pair of resistors Rdac. The current from both paths is integrated 
on Ci. The opamp keeps a null voltage difference between its inputs, hence the 
integrated current is 



Ici = 



in 



V 



2R, 



± 



V+. - K 



^ref 



ref 



‘^Rda 



(35) 



The sign of the summation is decided based on the decision of the S A quantizer, 
represented in the schematic by the logic bit BO. If the decision BO is logic-High, 
it means the output of the integrator is too large and Vref is switched to reduce 
the output voltage. 

The noise power at the input of the integrator is given by the input and DAC 
resistors and the Qm of the opamp ’s first stage 



PcTnoise = SkTBW{Ri + ^ 

-^dac 



+ 



^9m 



(36) 



As opposed to SC integrators, there is no noise aliasing since no noise sampling 
is taking place. 




Figure 13: Active RC integrator with DAC resistors 
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The integrator using DAC resistors can be extended for multi-bit DACs by 
adding DAC paths, independently connected to the same Vref. 



5.2. Effects of Circuit Non-Idealities 

In comparison with a switched-capacitor (SC) integrator, the only active element 
in a CT integrator with resistive DAC is the operational amplifier. In such integra- 
tor topologies the opamp is the circuit element limiting integrator’s performance, 
assuming highly linear on-chip passive components are available. 



5.2.1. Finite, Nonlinear Opamp Gain 



The finite gain of the opamp used inside the CT integrator affects the transfer 
function of the integrator, in a similar manner as in the case of SC integrators. Yet 
its effect is different in the CT integrator. The transfer function of the integrator 
with an opamp exhibiting finite DC gain is 






+ sRiQ 



( 37 ) 



for a finite opamp DC gain of Anc- This equation shows that both the DC gain 
of the integrator and its pole are affected by the finite DC gain of the amplifier. 
These changes in integrator transfer functions are expected to cause loss of S A 
conversion performance mainly in terms of resolution, by introducing changes 
in the designed NTF likely to increase quantization noise inside the signal band- 
width. 



5.2.2. Finite, Nonlinear OpAmp Transconductance 

With a voltage input, the active-RC CT integrator still integrates a current. The 
major non-linearity, analyzed here, is introduced by opamp’s finite and strongly 
nonlinear transconductance g^. A simplified model of a single-stage amplifier 
can be used for this analysis, yielding results that can also be applied to more 
complex opamp designs by replacing the in this model with the equivalent 
gm of a multi-stage, correctly-compensated (approximately single-pole behavior) 
amplifier. 



5.2.3. Finite Opamp Bandwidth and Slew-Rate 

Compared to SC integrators, the finite bandwidth of the amplifier in a CT inte- 
grator does not affect the linearity of the integration but the in-band quantization 
noise power by changing the NTF shape. For a single-pole opamp with a unity- 
gain bandwidth of uq and DC voltage gain Ajjc^ the frequency-dependent voltage 
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gain A(s) can be used in Eq. 37 to find the transfer function of a CT integrator 
containing the bandwidth-limited opamp 

_ Jl _ L 



A(s) + ^ (i + 



(38) 



The equation above shows that, in order to have a good approximation of the 
ideal integrator function, 

J^^^ydeal ^ (39) 

SlLiCji 

not only the DC gain A^c should be much larger than 1/ \ sRiCi\, but also the 
unity-gain bandwidth of the opamp should be much higher than the unity-gain 
bandwidth of the integrator 

(40) 



> 



RiCi 



To compare the result in the equation above with the sampling frequency 
fcLK of the SA ADC, the value of the loop coefficient associated with the inte- 
grator should be used 

-^i^ijCLK 

For typical a* values of 0. 1 to 0.3 results that 

> fcLK ■ ■ ■ ^fcLK (42) 



The exact value depends on the overall tolerance to in-band noise leakage and it 
scales with the fill factor for a retum-to-zero (RTZ) DAC shape, that changes the 
1 / RdacCi to (5/ RdacCi, with S the RTZ fill factor. 

The second issue associated with a limited opamp bias current is the slew- 
ing of the amplifier’s output stage. Since the signal processed is the integrated 
current, a limitation of this signal would introduce distortion in the overall A/D 
conversion. Therefore, a hard limit can be found for the biasing current of the 
output stage of the opamp as the peak of the integrated current during nominal 
operation of the SA ADC. This is usually not a very restrictive limit. 



5.3. Power Consumption Analysis 

A good estimate of the CT integrator’s power consumption can be obtained from 
the required opamp transconductance. For each integrator in a SA ADC, the 
input and DAC resistors, as well as the integration capacitor, are calculated from 
the noise budged of the integrator and its associated loop coefficients. The values 
of the passive components are in turn used to calculate the opamp’s gm required to 
keep the distortion lower than the limit allocated to the integrator. The integrator 
is assumed to be a active-RC, fully differential design with both input and DAC 
resistive paths, as shown in Fig. 13. 
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5.3.1. Noise Performance 



For the differential CT integrator in Fig. 13 the noise budget is distributed among 
the two pairs of resistors Ri and Rdac, and the operational amplifier’s transcon- 
ductance Qm, resulting (Eq. 36), for a signal bandwidth BW, 



R} 



7 



PcTnoise = SkTBW{Ri + ^ + ^) 

^dac ^9m 



(43) 



The equation above can be written in a more compact form if the expressions 
connecting the S A loop coefficients with the passive circuit components are used 
(see Fig. 9) 



RiCifcK' ^ 

to write PcTnoise as function of Ci and Ri 



Ri 

Rdac 



(44) 



P OTnoise 



SkTBW 

aCifcK 



1 + / + 



7 

^9mRi 



(45) 



To quickly estimate Ci, the last term of the equation above can be ignored or 
be allocated a (small) fraction of the noise budget. Knowing Ci, the resistors 
can be calculated and used to calculate Qm from the distortion specification. Al- 
ternatively, an optimized set of circuit parameters can be calculated if first the 
distortion specification is used to extract an expression for the gmRi product, free 
from other circuit parameters. 



5.3.2. Distortion Performance 



Assuming the opamp has a large gm [2], third-order harmonic distortion is given 
by the amplitude of the input signal Va, the nonlinear component of the opamp 
transconductance g^, and the equivalent resistor connected at the input of the 
opamp, R, 



R 



1 

= -^ + 



HDic 



VI 



(46) 



Ri ' Rdac' " 24g^R^ 

the distortion results to only depend on the non-linear component of the opamp ’s 
transconductance, gs. The expression of is calculated as the third derivative 
of the residual input voltage which, for a MOS input pair with transistors sized 
WjL in strong inversion, is 



ySI 

y j. 



2KpW/L (\A^ it 



while, for weak inversion transistors, it is 

= uVrln 



h + R 



(47) 



h-R 



(48) 
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Derivatives of these expressions yield the transconductance components gm and 
^ 3 . Considering the expression of R, the third harmonic distortion becomes, for 
strong inversion transistors, 



HDl^ 



vlv^t (. , Ri 

32RfIi \ ^ Rdac 



and, respectively, for weak inversion. 



hdY^ 



VlnVr (. , 

V ^ Raac 



(49) 



(50) 



5.3.3. Required OpAmp Transconductance 

From the equations above and using Eqs. 44, a compact expression of the re- 
quired gm for transistors in strong inversion 



„si ^ i_ . M(i + /) 

Rt\jZ2V},HD^ 

and, respectively, for weak inversion 



wi _ J_ 3/ (J- + /) 

Ri V 48n2l/y2^Z?3 



(51) 



(52) 



Both expressions yield a value for the product gmRi which is free of other design 
parameters 



K, 



9m — 



gm 



Ri 



(53) 



The factor Kgm can be calculated before the optimization of each integrator takes 
place, from SA loop coefficients, performance requirements, MOS transistor 
parameters, and a design decision for Vgt when strong inversion is used. 

With Kgm, the integration capacitor C{ can be calculated from the noise bud- 



get 

_ SkTBW / ^ ^ ^ 7 

O-PcTnoisefcK \ ^Kgm 



(54) 



and, using Eq. 44, the resistors Ri and R^ac are calculated. With Ri, gm is 
calculated from Eq. 53 or directly from Eqs. 5 1 and 52. 



6. Computer-Aided Design of Sigma-Delta ADCs 

Since no accurate analytical model is available for EA ADCs in general, there is 
need to simulate different possible solutions before deciding which one has the 
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potential to be the optimal one. This Section presents a systematic procedure to 
exhaustively explore the design space of E A ADCs, by automatically designing 
and evaluating a large number of candidate solutions based on behavioral sim- 
ulations. Two design examples are also shown, to illustrate the advantages and 
limitations of this procedure. 



6.1. Filter-Level Design 

A fast design space exploration is performed at filter-level, using a linearized 
model of the E A ADC. The goal of this search is to quickly evaluate all solutions 
in the design space and select only candidates that can attain the target dynamic 
range (DR). The search algorithm is shown in Fig. 14. The entire SA ADC 
design space is split in two sub-spaces, the topology sub-space and the parameter 
sub-space. The topology design sub-space is defined by the ORDER, number of 
cascaded LOOPS and number of BITS, while the parameter design sub-space is 
defined by the oversampling ratio OSR and peak NTF (noise transfer function) 
magnitude. Multiple parameter sub-space elements can be associated with every 
topology sub-space element, changing the performance of the final solution while 
keeping a fixed physical implementation. All dimensions of the search space are 
browsed using constant stepping, linearly in topology space and exponentially in 
parameter space. 

At each step a set of two filter transfer functions are generated, the NTF and 
the signal transfer function (STF). Filter generation can also optimize in-band 




Figure 14: Filter-level exploration algorithm 
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zeros for the NTF (to reduce total in-band noise power [4]) and off-band zeros 
for the STF (to reduce in-band gain ripple and increase off-band rejection). After 
the two filter transfer functions are generated the in-band noise power is estimated 
to find the DR. A test is applied with two thresholds derived from the target DR 
(in dB) 

^ ^target 4“ ^ Slower ^ DR ^ D Rfdj^g^f -|- DR>npp^j‘ (55) 

The lower limit tests if, after the circuit (white) noise is added, the converter 
still reaches the final target DR. The upper limit is set to reject power-hungry 
solutions which offer more quantization-noise DR than actually needed. If the 
test is passed, the solution is saved in a database. 

For single-loop solutions the algorithm stops when a combination of mini- 
mal OSR and NTF is found. For cascaded designs however, because significant 
architectural details are not available at the filter level, the algorithm continues 
searching for solutions even after finding the first valid one. This insures that no 
valid solution is prematurely rejected. 

The limits within which NTF is browsed are decided based on a database 
stored on disk containing, for each set of (ORDER, BITS), the value of the over- 
loading level. This database is built by applying the entire design procedure ex- 
plained in this section (both filter-level and architecture-level design) to the entire 
topology design sub-space with the LOOPS constrained to 1 (OVL of cascaded 
solution is ultimately decided by the a single loop). In the parameter sub-space, 
NTF is sampled in a wide range while OSR is set to 64, because OSR does not 
affect the overloading level. The use of the NTF database results drastically 
reduces run-time because the solutions considered as possible candidates are ac- 
tually guaranteed to yield stable E A loops. 

6.1.1. NTF Database 

The one performance parameter of E A ADCs that cannot be predicted accurately 
by linear modeling is the overloading level (OVL) [1]. In the algorithm presented 
here, the overloading levels for each single-loop EA ADC are computed once 
from time-domain simulations and stored in a database. The peak magnitude 
of NTF is varied in a wide range. Each single-loop E A ADC is designed and 
optimized in a range of input signal values for each NTF peak magnitude. The 
overloading level is detected as the input signal which causes integrator clipping. 

Fig. 15 shows the entry in the peak NTF database for the single-bit, fourth- 
order E A ADC. It shows the variation of overloading level OVL, peak output of 
the last integrator Out4 and the peak SNDR in the NTF range where the loop can 
be stabilized. The decision that the loop can be stabilized is taken by comparing 
the values of integrator coefficients against a bottom value. If all integrator co- 
efficients are larger than the bottom value and integrator outputs are smaller than 
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the Vciip value, the loop can be designed stable and there exists a dB-negative 
overloading level. 

Because of this tight relationship between the OVL value and the ratio of the 
clipping voltage over reference voltage, the database has to be generated for each 
different ratio of VcUplVnef. The same database can be used for any target DR 
(SNDR) design as long as the VciipjVRef ratio does not change. 




Figure 15: Peak NTF magnitude database entry example 



6.1.2. Dynamic Range Estimation 



A fast yet accurate method to estimate the DR is used. It overcomes the draw- 
backs of both time-domain simulations, which are slow, and classical DR esti- 
mation using a formula which is inaccurate especially for high-order E A loops 
[ 1 ]: 

DR = - 1)2 (56) 

First the magnitude of NTF is calculated in n points equally spaced from DC 
to fs/2 from its polynomial form, in Z domain. Considering the total power of 
quantization noise [1] if the quantizer step is A, the (white) quantization noise 
amplitude in each bin i from DC to fs/2 (from 0 to n), referred to Vref = 
A(2^-i)/2, is 



Sqn,Bbit — 20 / 0^10 




- 20(5 - l)logio{2) [dBR] 



(57) 



if the quantizer has B bits. 

The value of the NTF magnitude, expressed in dB, is added to the value of 
Sqn,Bbit ^ud a curvc showing the quantization noise amplitude in each bin is 
drawn. Its integral in the band of interest yields the estimated DR value. Fig. 
16 shows the estimated (left) and time-domain simulated {right) spectrum of the 
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Figure 16: Estimated (left) vs. time-domain simulated fright) noise shaping 

output of a fourth-order, one-bit, single-loop SA ADC. The two graphs show 
the good accuracy of this estimation method for high-order loops. An extended 
comparison between the classical formula and the polynomial estimation method 
applied to single-loop EA ADCs shows that the estimation error is always less 
than 4dB, as opposed to the error yielded by calculating the DR using the clas- 
sical formula which can be comparable to the target DR itself. The polynomial 
estimation method also works better than the calculation for cascaded loops. 

6,2. Architecture-Level Design 

The architecture-level exploration algorithm evaluates the performance of the 
filter-level solutions mapped on a specific architecture, as shown in Fig. 17. If 
a single-loop solution is processed, an architecture is generated with the feedfor- 
ward and feedback connectivity specified by the user. An initial set of coefficients 
are then calculated and simulations are performed with a wide range of input sig- 
nal amplitudes to determine the OVL. Because the initial set of coefficients are 
not optimized they can cause premature overloading if integrator clipping to the 
supply rails is simulated. Therefore, for these simulations the clipping voltage 
is considered to be many times larger than the actual value. Since the reference 
voltage remains the same, this simulation setup is equivalent to using no-clipping 
integrator outputs. Single-tone input signals produce frequency-dependent OVL 
values, so the input signal applied at this design stage is a pulse, a good approxi- 
mation of a busy signal [1] which produces a frequency-independent OVL. 

The next step is the coefficient optimization, performed at the OVL previ- 
ously detected by observing the peak output of each integrator and by correcting 
the associated loop coefficient accordingly. The SNDR and DR variations as a 
function of the input signal are then simulated using a sine input signal. The 
two curves are tested for performance and passing solutions are saved for further 
processing. 





Figure 17: Architecture-level design space exploration 

For the cascaded E A ADCs the only architectural details predefined at filter 
level are the number of loops and the number of bits in the last loop. Therefore, 
the filter orders of individual cascaded loops are generated as one additional de- 
sign space dimension named ORDERS in Fig. 17. Another new design space 
dimension is the number of BITS in the first loop of the cascade. For simplicity, 
the last n — 1 loops in a cascade of n loops have the same number of bits as the 
last one. Each derivative of the input solution in the extended design space is 
analyzed as a possible solution. The architectural derivative is built starting from 
the parameters supplied in the filter-level solution, namely the peak NTF and the 
number of quantizer bits for the last loop in the cascade. The peak NTF in the 
filter-level solution is the product of peak NTF values in each loop. 

The generation of a cascaded design architecture consists of finding the \NTF\ 
value that yields stable NTFs for all the loops in the cascade, considering their in- 
dividual order and quantizer bits. The individual orders and quantizer bits are set 
at this stage by the filter-level solution and the ORDERS and first quantizer BITS 
design parameters, so the range of common stable NTFs is found by performing 
a logical-AND between the stable ranges of each loop in the cascade, as extracted 
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from the peak NTF database. There are cases when the logical-AND yields a null 
range and the architectural derivative is then dropped. Each individual loop in 
the cascade is then designed following the procedure described for single-loop 
solutions. The coefficients connecting the loops are derived from corresponding 
loop coefficients [6]. 



6.2.1. Performance Test 

Performance testing is based on statistics of SNDR and DR curves as functions of 
input signal level. Linear regressions are performed on each curve, from SNDR 
zero-crossing to the overloading level. The slope of SNDR is tested to be close 
enough of the desired conversion gain (typically unity). A slope outside this 
range shows a strong dependency of quantization noise power of the input signal 
level, which is not desired. The intercept of the DR curve is then tested against 
the target DR value to insure the target DR is attained in the worst condition. 
Finally, the peak regression residual of the DR curve is tested to be lower than 
6dB (1 bit) to insure the required integral non-linearity (INL). 

SNDR{Vin) and DR{Vin) curves for an architecture rejected by perfor- 
mance test algorithm are shown in Fig. 18. The dotted lines are the linear re- 
gression fitted values for both simulated curves. The drop in DR at high input 
levels (larger than -20dB) shows that the NTF aggressiveness needed to reach 
the target DR is too high [4]. The peak SNDR which still keeps a good overall 
INL is about 85dB instead of almost 95dB, as shown by its absolute peak value. 
Based on the fact that the decrease in peak SNDR already disqualifies this solu- 
tion, the simple yet effective criterion of peak regression residual limiting is used 
as rejection criterion. The slope of DR fitted line shows that the test for DR inter- 
cept works toward rejecting the solution as well, even if peak regression residual 
would be passing its test. The test for SNDR slope also works toward rejecting 




Figure 18: SNDR and DR curves for a rejected architecture 
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the solution. 



6.2.2. Estimation of Power Consumption 

Each integrator in the loop has its power consumption characterized by the Qm of 
the operational amplifier insuring the required integration linearity. To calculate 
the total Qm for a designed architecture-level solution, the passive components 
(resistors and capacitors) in each integrator have to be calculated from the noise 
budget and placed in the power models developed in previous Sections for SC 
and CT circuits respectively. 

The noise budget of each integrator is allocated based on evaluation of total 
power consumption as a function of noise budget distribution across the con- 
verter. A part R of the noise power of the previous integrator is allocated to the 
next in the loop 

{Pn)i — R {Pn)i-i (58) 

and the value of R is chosen to minimize total power consumption. Up to 25% 
reduction in current consumption by optimization of R has been observed on 
practical designs. 



6.2.3. Yield Analysis and Optimization 

From the accepted solutions, a top-ten set is chosen based on the ratio of peak 
SNDR and the power consumption in opamps measured by the total For each 
top-ten solution, a Monte-Carlo analysis varies the EA ADC coefficients using 
a normal distribution with the standard error supplied by the user. The user can 
also specify a spread 3a value along with the coefficient-to-coefficient mismatch 
3a value. The spread is used in the case of CT designs to simulate the effect 
of RC product spread over design performance. A few hundreds Monte-Carlo 
simulation steps are run for the top-ten solutions and the performance tests are 
applied. The single-loop solutions can be designed from the early stages to give 
100% yield with relaxed matching requirements by controlling the difference be- 
tween the quantization noise power and in-band white noise power. For cascaded 
solutions however, the yield can be corrected by increasing the number of bits in 
the first loop. This decreases the quantization noise power in the first loop and 
therefore the power of the mismatch-induced, low-order shaped noise leaking to 
the output. When the yield is lower than 90% the number of bits in the first loop 
in the cascade is increased and another Monte-Carlo yield analysis is started. 



6.3. Design Examples 

Two examples are presented to show the effectiveness of global optimization 
through exhaustive design-space exploration. The first one is an audio E A ADC 
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powered at 1.5V with rail-to-rail input and 1.5V reference voltage. A designed 
circuit has been reported [17] which consumes 0.95mW for a DR of 98dB and 
peak SNDR of 89dB at a signal bandwidth of 20kHz. The second one is an ADC 
for xDSL applications, powered at 2.5V, also with rail-to-rail input and reference 
voltage equal to the supply voltage. The signal bandwidth is 2MHz. A design 
has been reported which consumes 90mW in the analog circuits to attain 95dB 
DR and 90dB peak SNDR. 

The designs mentioned above are state-of-the art examples. The results pre- 
sented here show that other architectures can offer better peak SNDR versus sup- 
ply power consumption ratios (here used as figure of merit FOM) but these de- 
signs are still among the best options. 



6.3.1. Audio Sigma-Delta ADC 

The search for an optimal audio S A has been first performed in the entire design 
space, to find the global optimum. The global optimization results are shown in 
Table 1. It is worth noting the massive presence of cascaded solutions. The col- 
umn LOOPS shows each cascaded loop’s order in parentheses, while the column 
BITS contains the number of bits in the first loop in parentheses. The solutions 
have virtually the same figure-of-merit FOM. All solutions have OSR=32 and a 
large number of bits in the first loop, which increases the FOM value by increas- 
ing the overloading level with no additional power costs (in the power model used 
here). 



Table 1: Global solutions for Audio SA ADC 



ORDER 


LOOPS 


BITS 


OSR 


SNDR 


FOM 


4 


2 (2-2) 


2(6) 


32 


97.5 


119.8 


4 


3 (2-1-1) 


2(5) 


32 


97.3 


119.5 


5 


3 (2-1-2) 


1(7) 


32 


98.5 


120.2 


5 


2 (2-3) 


4(7) 


32 


98.5 


119.8 


5 


2(2-3) 


5(7) 


32 


98.1 


119.5 



Table 2 contains the optimization results for a set of solutions restricted by 
the number of loops (LOOPS=l). The best solution is the third order, 4-bit loop, 
again working at OSR=32. This solution is also remarkable by its low number of 
bits compared to the other top performers. 

Further design space restriction to 1-bit single-loop architectures yields only 
the state-of-the-art fourth-order, single-loop solution with OSR=64 reported in 
[17]. The solution is chosen from a set of four possible, three of which do not 
pass the yield test. 
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Table 2: Audio ADCs with L00PS=1 



ORDER 


BITS 


OSR 


SNDR 


FOM 


3 


6 


16 


95.3 


117.6 


3 


8 


16 


96.3 


117.8 


3 


4 


32 


95.5 


118.4 


4 


8 


8 


95.6 


116.4 


4 


5 


16 


96.0 


117.8 


5 


4 


16 


95.4 


116.5 


5 


6 


16 


96.2 


115.9 



6.3.2. Delta-Sigma ADC for xDSL Applications 

The results of global optimization for a 4MS/s SA ADC are shown in Table 3. 
Again a third-order solution with OSR=32 and 4-bits quantizer has good perfor- 
mance, but most of the solutions operate at 16 times oversampling. They also 
have large number of bits in the (first loop) quantizer to attain high overloading 
levels. 



Table 3: Global solutions for xDSL E A ADC 



ORDER 


LOOPS 


BITS 


OSR 


SNDR 


FOM 


3 


1 


4 


32 


96.5 


102.7 


4 


2 (2-2) 


4(8) 


16 


97.4 


103.1 


4 


3 (2-1-1) 


4(7) 


16 


98.1 


102.7 


5 


2 (2-3) 


3(8) 


16 


96.6 


102.1 


5 


2 (2-3) 


4(6) 


32 


97.1 


101.7 


6 


3 (2-2-2) 


2(8) 


16 


97.4 


102.6 



To avoid solutions like the ones requiring 32 times OSR, the search space 
is limited, by (ORDER=5, OSR=16). Furthermore, the maximal number of bits 
can be limited, for example to 6, to keep a low DAC complexity. The solu- 
tions for this search are shown in Table 4. They are all cascaded SA ADCs ex- 
cept two which, even with their low-FOM, can be good choices for low-voltage, 
mismatch-tolerant designs. The best are the three-loops with a 2-2-1 configu- 
ration. The state-of-the-art solution reported so in [15] is among them. During 
initial optimization stages, only 3 bits were needed in the first loop, but yield op- 
timization reached the 5-bit solution reported in [15], the increase being needed 
to accommodate capacitor/capacitor mismatch effects. 
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Table 4: xDSL ADCs with 0RDER=5, OSR=16 



LOOPS 


BITS 


SNDR 


FOM 


2 (4-1) 


2(5) 


93.1 


98.0 


3 (2-2-1) 


2(6) 


97.1 


102.6 


3 (2-2-1) 


3(5) 


95.7 


101.0 


3 (2-2-1) 


4(6) 


97.8 


102.5 


1 


5 


96.6 


100.4 


3 (2-2-1) 


5(5) 


98.1 


102.6 


1 


6 


96.5 


99.6 


3 (2-2-1) 


6(6) 


98.2 


101.7 
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Abstract 

A novel implementation for algorithmic and pipelined ADCs is 
presented in this paper. A floating voltage hold buffer is proposed 
which enables the accurate addition of signal voltages without 
requiring precisely matching and linear components. A new 1.5- 
bit stage is presented based on the floating hold buffer in which 
voltage multiplication is replaced by voltage addition. An experi- 
mental 12-bit 3.3 MS/s algorithmic ADC in 0.25//m standard 
CMOS for a 2V application is described. It occupies 0.15mm^ of 
die area and dissipates 5.5mW. The power and area FOMs are 
well below those previously reported for 1.5-bit algorithmic ADC 
stages. 
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1. Introduction 

There is an increasing trend to embed Analog-to-Digital Converters (ADCs) with 
the digital CMOS VLSI for applications requiring medium to high resolutions (10- 14- 
bits) at sample frequencies up to a few MHz. This reduces cost, board space and board 
complexity, pin count and overall power consumption. For instance, such ADCs have 
become a ubiquitous peripheral in micro-controllers for servo applications, touch 
screens, measurement of supplies and die temperatures, etc. 

The algorithmic or cyclic ADC [1],[2] is an excellent choice of architecture for 
applications where die area and power consumption are at a premium. The pipelined 
ADC [3], [4] is an excellent choice for applications demanding high speed. The core of 
both the algorithmic and the pipelined ADC is the highly efficient and robust 1.5-bit 
stage. It can be easily scaled in area and power down the pipeline for improved effi- 
ciency in pipelined converters [5]. The existing basic switched-capacitor (SC) charge- 
transfer circuit architecture for the implementation of the 1.5-bit ADC stage hasn’t 
changed much in recent years [4], [6], [7]. Instead much attention is being spent on 
improving calibration routines for the cancellation of the effects of capacitor mis- 
matches [8], [9], [18], [29]. Capacitor matching is difficult to control and usually limits 
the resolution of the 1.5-bit stage to below 10-bits in standard CMOS processes [14]. 

This paper proposes a novel circuit architecture for implementing the 1.5-bit ADC 
stage required for algorithmic and pipelined ADCs. The voltage multiplication opera- 
tions needed for such ADCs are implemented by replacing multiplication by addition. 
Simple techniques are presented for achieving high accuracy addition without using 
precision components. The resolution for a given power dissipation and sample rate 
can thus be significantly increased. 



1.1. ADC Efficiency 



Area and power figures of merit, i.e.FOM^^^^ and are useful mea- 

sures of the relative performance of ADCs because they compare objectively the effi- 
ciency of different design solutions. They are defined as: 






A 

2™x/, 



2 

nm /conversion/Hz 



( 1 ) 
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Power FOM (pJ/conversion) 

Fig. 1. Performance comparison of ADCs based on publications from 2000 onwards. 
and 

P 

POM — - pJ / conversion (2) 

^ ^ Js 

where ENOB is the effective number of bits and is the sample frequency. These 
FOMs normalize area and power into silicon area use and energy use per ADC con- 
version. They can be used to compare all types of converters irrespective of architec- 
ture, frequency of operation or type and generation of process used. There are three 
primary reasons for the continual improvement in efficiency of ADCs, namely 
shrinking technology size, improving design techniques, and novel design solutions. 
A comparison of recently published pipelined ADCs (year 2000 onwards) for similar 
technologies is shown in Fig. 1. The area and power FOMs of the current work - 12- 
bit algorithmic ADC - are also depicted in the same diagram. FOM and 
FOM of the corresponding pipelined ADC, when calculated based on standard 

stage-to-stage scaling i : i ; i - i : . . . 1 [5], are each improved by a factor of 4. 

Indeed, the sample frequency increases by a factor of 12 while the area and power 
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Fig. 2. Algorithmic ADC block(s) diagram 
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Fig. 3. 1 .5 -bit Algorithmic Block conversion stage. 



each increase by a factor of 3 (1), (2). This too is depicted in Fig. 1. It can be seen that 
this solution achieves the lowest area and power FOMs. 

1.2. The ADC Algorithm 

Any continuous analog voltage can be approximately represented in a n-bit 
binary form according to the following recursive algorithm [1]: 

f 1, Vi > 0 

Di^\ 

[-1, otherwise, 
i = l,2,...,n, 



( 3 ) 
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where ^ref determines the resolvable input signal range j 

and Di, / = 1,2, is mapped onto bi, i = 1,2, . . .,n , to give the binary representation 
of y^ : • Voltage / = 1, 2, in (3) is referred to as an analog resi- 

due voltage for the i -th iteration. 

An analog-to-digital converter based on (3) is called an algorithmic (or cyclic) 
ADC if the recursiveness of (3) is mapped on to a single piece of hardware, and a 
pipelined ADC if the recursive algorithm is mapped to a pipelined cascade of hard- 
ware stages. A practical algorithmic ADC employing a SC circuit was first presented 
in [2]. In this ADC, a charge transfer technique is used to implement the arithmetic in 
(3) and the ADC resolution is limited by capacitor mismatch. 

The charge transfer technique [2] is improved on in [6], with further circuit effi- 
ciencies presented in [10]. Digital error correction, DEC, for the removal of offsets in 
operational transconductance amplifiers (OTAs) and comparators, originally pro- 
posed in [11], has become an essential technique for modern algorithmic and pipe- 
lined ADCs [9],[12], including the current work. The operation of the 1.5-bit DEC is 
represented analytically by: 









I, 



0 , - 



v,> 



ref 



- 1 , 



ref 



ref 



< V, < 






ref 



4 ' 



i = 1, 2, . . . , n. 



(4) 



A block diagram of the algorithmic or pipelined ADC is shown in Fig. 2. The Clock 
Generator provides non-overlapping clocks and the Control Block allows the ADC to 
be configured for synchronous or asynchronous sampling. The 1.5-bit DEC block 
accumulates (4) to form a 12-bit output word. The purpose of the (single) algo- 
rithmic block in the case of the algorithmic converter, or the series of algorithmic 
blocks in the case of the pipelined converter is to perform the arithmetic of (4). 

Fig. 3 is a detailed representation of the Algorithmic block. It shows a 1.5-bit (two- 
comparator) flash ADC, a 1-bit flash ADC for finalizing the A-to-D conversion, a 
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1.5-bit DAC capable of generating ^ ^/H (sample-and-hold) and 

MX2 (multiply-by-2) blocks. Note that Fig. 3 is an alternative representation of (4) 
and is common to most algorithmic ADCs. The impact of the non-ideal building 
blocks in Fig. 3 on the performance of the algorithmic ADC is explained in [13]. 

To emphasize the hardware pertaining to existing implementations, the residue 
transfer function is rewritten as 

^out — 2 X — 1 X D • Vfgf (5) 

The multiplier factors (1 and 2) depend on capacitor ratios in existing charge transfer 
hardware realizations of this equation (eg. [14]). Charge is actively transferred from 
capacitor to capacitor via the virtual earth node of an OTA so that the accumulated 
charge on the feedback capacitor of the OTA produces (5). This method is lim- 
ited by the inaccuracies of capacitor matching as well as any non-linearity of capaci- 
tors. The DAC output voltages D* Vref e { + Vref’ ^an be produced 

relatively accurately since it is sufficient to switch polarities between -^Vref 
- or short to 0 in a differential realization. 

In the method proposed here, the MX2 stage is implemented without using multi- 
plication (i.e. charge transfer) but instead an accurate analog adder is employed. Fur- 
thermore, the DAC voltage is added without the need for charge transfer. In fact, 
formula (5) is rewritten as y^^^ in ~^^in ~ ^ ^ ref • will be presented next. 

2. Realization Concepts and Building Blocks 
2.1. Analog Addition 

In this section, a concept is presented for implementing the arithmetic operations 
(4) of the ADC [1] in analog hardware without using multiplication. Analog multipli- 
cation, within the context of a SC circuit, is a series of voltage-to-charge and charge- 
to-voltage conversions - known also as the charge transfer technique {QT). 

Typical concepts for creation of the MX2 (multiply-by-2) function are shown in 
Fig. 4(a), (c). Voltage multiplication by 2 occurs in Fig. 4(a) (C ^ 2C) with the 
charge transfer of Q = Vq -2C of capacitor = 2C to another capacitor C 2 = C, 
initially discharged, via a virtual earth node to become 2yQ. This method is sensitive 
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to the ratio of capacitors C 1 /C 2 . An adaptation of this method is shown in Fig. 4(c), 
which is called a charge transfer with flip-around (C — > C), where both C\ = C and 
C'l—C are pre-charged with Vq, after which the charge on is transferred to C 2 . A 
half a bit of extra accuracy can be achieved with the C C circuit compared to the 
C 2C circuit but in practical applications no better than 9- 10-bits accurate multi- 
plication-by-2 is realistic nor has been shown possible using these methods in stan- 
dard CMOS processes. Example implementations of these QT multiplication 
concepts are demonstrated in figures Fig. 4(b), (d), where the virtual earth node is cre- 
ated within the feedback loop of an OTA. The 1.5-bit DAC function is also created 
by multiplying the DAC voltage by a factor of 1 (Cj = C)/(C 2 = C) through charge 
transfer via the virtual earth node to the output capacitor C 2 . 



Charge Transfer 

qSvq 



Virtual Ground v= 2 V, 



~j^=2C ’ ^=2C ~j^ ( 

(a) 



Q=0 lQ=2CVo 



Charge Transfer 
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[Q=CVo 

f^=C 



Virtual Ground v=2V„ 



^=CVo 



o 



^=0 |Q=2CVn 

~j^=c ~|c^c 
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^CVq ®|q=cv, 
c^=c ^ 
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£=CV„ 
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V=Vo 
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(d) 
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Fig. 4. Methods for creation of MX2 function with (a) charge transfer C — > 2C, 
with a realisation (b); (c) charge transfer and flip around C C, 
with a realisation (d); 
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An alternative concept (C + C), which is new for creation of the MX2 function, is 
based on voltage addition. Capacitors can be used for addition, as demonstrated in 
Fig. 4(e), where instead of multiplying Vq by 2, voltage addition occurs by first 
charging each of Cj and C 2 to Vq and then placing these capacitors end to end, with 
say the plate (3) of C 2 connected to plate (2) of C^. Furthermore, to fulfill the 1.5-bit 
DAC function, the output can be easily level shifted by adding a voltage at plate (1) 
of Cl- 

When realized in a CMOS process, both terminals of C|, C 2 have parasitic cou- 
pling capacitance. This parasitic capacitance results in an inaccurate MX2 function. 
While the C + C concept of Fig. 4(e) is simple, its accurate implementation is diffi- 
cult - Fig. 4(f). In fact, direct implementation of the C -I- C concept is only suitable 
for low resolution ADCs of less than 8-bits. 



2.2. Floating Hold Buffer for Accurate Analog Addition 



In order to accurately implement the voltage addition concept of Fig. 4(e), a float- 
ing hold buffer is proposed with one input and one output, where the output voltage 
is held relatively constant and kept insensitive to the input voltage (i.e. of the form 

^out ~^hold 

Usually a single capacitor (C) is used to sample and hold a voltage - see, for exam- 
ple, Fig. 5(a),(b). Assume that C is used as a floating hold capacitor so that the only 
way for charge to escape from its top plate terminal is through the parasitic capacitor 
associated with the top plate - Fig. 5(c). The voltage across C in the hold mode 
is then given by 

yhoid=Vhoid,+VA-^^ ( 6 ) 




Fig. 5. (a) Sampling capacitor during sample phase, (b) sampling capacitor within hold circuitry, 
and (c) its connection to high impedance input of active element during hold phase. 
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Fig. 6. (a) Cl and C 2 used to sample a voltage, (b) hold phase, 

(c) proposed implementation using amplifier. 



where is the initial correct value of the hold voltage at the sample moment and 

is a floating voltage at node A. According to (6), V^oid depends on . This lim- 
its the quality of the floating voltage hold circuitry employing a single sampling 
capacitor C. 

Consider now two such capacitors used to sample ^C2 q " 

equations for the voltages across each of C^, C 2 in the hold mode, Fig. 6(b), are 



^C1 - ^CIq 



C 



pm 



^C2 -^C2o ■■ 



^ pm 
Co 



( 7 ) 



If we now set the condition in the hold mode that the voltages at the capacitor top 
plates are held equal, i.e. then the voltage held across both capacitors of 

Fig. 6(b) is given by 



Vci-Vc2-(Vcio-^C2o) + V^^ 



^ c c ^ 

par^ ^ pur2 



( 8 ) 



'2 J 



Defining the voltage held as - V ^2 , with = Veto “^C 2 o*e ini- 

tial correct hold voltage at the sample moment, equation (8) can be re-written as 



^hold - ^hold 



0 ^ 



'pm 



pm 



( 9 ) 
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. pciK\ DGKo par 

Since —p; p — ^ , then hold voltage (9) is significantly less sensitive 

C 1 ^2 ^ 

to when compared to equation (6) for a single capacitor implementation. Recall 
that =Vq, and the only path for the top plate charge to escape is through the par- 
asitics. This implies that nodes A and B must be physically disconnected. 

Fig. 6(c) shows the implementation of the floating hold circuitry employing C^, 
C 2 and a single-ended OTA. The OTA equalizes the voltages at the top plates of Cj 
and C 2 by means of the negative feedback loop through C^, thus satisfying 
of (9). drives the bottom plate of C 2 causing a corresponding change in node 
voltage B at the OTA positive input. The hold voltage defined across Cj and 
C 2 is insensitive to the voltage at the input terminal For each capacitor and 
C 2 , there is only parasitic charge displacement between the top plate and the parasitic 
coupling capacitance connected to the top plate at the OTA input side - this is similar 
to a delta-charge redistribution technique described in reference [15]. 

This high accuracy floating hold buffer operates such that the output is unity gain 
buffered from the input while maintaining a fixed pre-determined hold voltage 
between output and input. This is the fundamental building block needed for accurate 
voltage addition. 

2.3. Implementation of Single-ended and Differential MX2, L, DAC 

This section develops an application of the C + C concept to a 1.5-bit algorithmic 
ADC stage [30]. Recall that all the arithmetic required to carry out equation (4) can 
be done with analog voltage addition. Since the resolution of the algorithmic ADC is 
limited by the precision with which (4) can be implemented, the accurate floating 
buffer of section 2.2. is employed. 

An input voltage is sampled using two pairs of capacitors € 2 ^ and € 2 ^ - 
see Fig. 7. Each of these pairs of capacitors is used to create the feedback of a float- 
ing hold buffer of Fig. 6(c). A single-ended implementation of the 1.5-bit ADC stage 
is shown in Fig. 7(b). Two floating hold buffers connected in series double the sam- 
pled voltage. This function creates the MX2, while the toggle switch in Fig. 7(b) rep- 
resents the connection to the 1.5-bit DAC and hence this fulfills the implementation 
of the arithmetic of (4). Note that if each floating hold buffer has the same single-pole 
settling, then the settling time of both buffers in series increases by just one time con- 
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a) 




b) 




Fig. 7. Proposed C + C MX2, Summer and 1. 5-bit DAC level shift , 
(a) pre-charge phase, (b) single-ended, (c) differential. 



stant. For a 12-bit ADC, this means a 12% reduction in speed compared to a single 
buffer. 

The differential configuration - defined in [16] - assumes that any signal propa- 
gates through a pair of traces with identical impedance and coupling to surrounding 
components and signals. The differential configuration is preferred to the single- 
ended because, for example, of improved noise immunity and linearity. A differential 
implementation of the C + C concept is shown in Fig. 7(c). Note that in order to 
obtain a differential architecture, a series connection of floating hold buffers is 
replaced by a star connection. One floating buffer with € 2 ^ is used to obtain an 
accurate buffered version of the input, while the other floating buffer with € 2 ^ 
creates an accurate inversion so that the output voltage is double that of the input. 
The connection to the differential 1.5-bit DAC is again represented by the two toggle 
switches. This is then the differential implementation of (4). 

Another feature of the differential 1.5-bit ADC stage proposed in Fig. 7(c) is that it 
is inherently common-mode stable, unlike conventional differential ADCs employ- 
ing charge transfer techniques which require common-mode feedback to achieve sta- 
bility. Furthermore, because the sampling technique in Fig. 7(a) is well suited to both 
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differential and single-ended sampling, and because the circuits in Fig. 7(b), (c) are 
suitable for driving single-ended and differential lines, respectively, it is easy to build 
very accurate single-ended to differential and differential to single-ended conversion 
using the proposed design solutions. 

3. Practical Performance Issues 

The maximum resolution of a practical switched capacitor 1.5-bit ADC stage is 
limited primarily by capacitor mismatch and less so by finite amplifier gain and 
charge feedthrough from the switches. For the purpose of comparison of the com- 
monly used C — > C and proposed C + C 1.5-bit stages, similar capacitors are 
assumed to be used in both cases to sample the differential input signal. Fig. 8(a). 

Consider first the C ^ C circuit in hold mode. Fig. 8(b), with all relevant parasitic 
capacitances included. The transfer function, including capacitor mismatch, is 

■ 2 [l + ^ - D ■ V,,/ . [l + (10) 

where the relative signal capacitor mismatch is given by ^ ^^^ 2 ) ‘ 

Here, it is assumed AC C. Finite amplifier gain deteriorates the transfer function 
further giving rise to a total fractional transfer gain error of 




Fig. 8. Comparision ofC-^C and C+C circuit architectures including parasitic capacitors. 
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1 1 
2 C 



( 11 ) 



where the feedback factor Pc^C given by 



y^c-^c - 



C 

2C + 2C + C p^fi 



( 12 ) 



Due to the strong influence of capacitor mismatch on the C ^ C 1.5-bit stage, no 
better than 0.1% gain error can be expected so that some form of calibration or trim- 
ming must be used for an ADC accuracy of greater than 10-bits. Calibration gener- 
ally adds latency to the algorithmic ADC and increases area and power consumption 
not only as a result of the added calibration circuitry but because of the extra parasitic 
loading on analog circuit nodes and the consequent increase in amplifier power. 

The voltage transfer function of the C + C 1.5-bit stage is 



V. 



^c+c ^^«l^'^lC9+C 



C2 



par2 



c,+c 



pari 



DVr 



ref C9+C 



par2 



Ci+C 



pan 



;i3) 



With ^ 2 ' ^ 1^^ relative mismatch of the equiv- 

alent input parasitic capacitors ^ and C p^r^, equation (13) can be rewritten as 



V = 2V- 

^outc+c 



1 

2C 



ACr 



+ ^ I \-DV, 



ref 



1+-^ 



AC 



par 



' par 



+ ^)| (14) 



A small transfer error is produced proportional to times the mismatch of the 

signal and parasitic capacitors. Note that the extra term of AC p^^ jC p^j. must be 
accounted for when calculating the effect of mismatch. After inclusion of the effect 
of the amplifier, the total gain error for the C + C stage becomes 



^c+c 
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2 
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^ par 

c 



AC 

c ^ c 

par ^ 



1 

Pc+c 




2CMRR 



(15) 



with the feedback factor being given by: 



Pc+c - 



C 



^ pari2 ^ pari 



(16) 



For an N-bit converter having at least 0.5-bit accuracy, it is required that 
^C+C <’^ref 
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Standard OTA design techniques [17] allow the attainment of very high Aq and 
CMRR above 90dB which is sufficient for at least 15-bit accuracy. Indeed the DC 
CMRR can be designed to be of the same order of magnitude as Aq as explained in 
section 4. It is the capacitor matching that is the bottle neck to achieving higher than 
10 bits accuracy in charge transfer based 1.5-bit stage configurations, such as the 
C — > C method. Comparing formulae (11) and (15), the capacitor mismatch error of 

compared to the C — > C 
iven by AC/C 

par 1^ par ^ ^/sj^par ' Section 4., a CjCp^j. ratio of 35 has been obtained 
for this design. The matching of the parasitic capacitors (metal-metal and gate oxide) 
are expected to be at least as good as the signal capacitors (metal-metal only), so that 
a conservative estimate for the improvement in capacitor mismatch error of the 
C + C method compared to the C C method is C/2C^^^. This implies an extra 4 
bits accuracy (linearity) is possible over previous 1.5-bit conversion stages. Since the 
capacitors determine the efficiency of the ADC, it is possible to create a highly effi- 
cient architecture using the proposed C + C method giving rise to low power and sil- 
icon area. A further benefit is that the C -I- C architecture is relatively insensitive to 
capacitor non-linearity so that small area gate oxide capacitors could also be used. 

The feedback factor of the C + C stage is almost double that of the C — > C. The 
benefits of a larger feedback factor are to be found in improved amplifier gain, set- 
tling speed and noise. Switch charge feedthrough is largely cancelled at the input of 



l/Vc and 



the C + C method is reduced by xj ^ x ( 1 4- 
method. The relative capacitor mismatch errors are g 



S&H, Z, MX2 block in phase 1 1 .5b ADC S&^H, Z, MX2 blockjn phase 2 




Fig. 9. Implementation of C + C ADC 1.5-bit conversion stage. 
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each OTA due to the differential switching scheme and the appropriate use of early 
clocking ensures that the residue output of the C + C 1.5-bit stage is cleanly sampled 
before the switches connected around the OTA change phase. 



4. 1.5-bit Per Stage Circuit Implementation 



The merged implementation of the 1.5-bit conversion stage of Fig. 3, comprising 
the S&H, Z, MX2 and 1.5-bit DAC (implemented recursively in this work as an algo- 
rithmic converter) is demonstrated in Fig. 9 according to the method proposed in sec- 
tion 2. The ADC using the C + C technique is fully differential with the signal range 
given by = (RefP-RefN). Assuming initially that the DAC output is at RefCM, 
i.e. differential OV, then on a given cycle of the ADC, say phase 1, differential output 
voltage = Vq is present across the series combination of and in paral- 
lel with the series combination of and On the following cycle, phase 2, Vq 
is applied across the top OTA with between the top OTA output and its negative 
input and placed between the OTA positive input and the top output terminal of 
the DAC. Similarly, across capacitors and a further Vq is available 
between the bottom output of the DAC and the output of the bottom OTA. The net 
effect after phase 2 is that a voltage of = 2xVo is present between the output 
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Fig. 10. Current mirror amplifier with gain boosting. 
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Fig. 11. Dynamic comparator. 

terminals of the OTAs. If the DAC voltage changes to either RefP or RefN, then a 
voltage of either or is added to 

The 8 ADC sample capacitors are each 2pF and composed of inter-digitated metal- 
metal capacitors. Metal layers Ml, M2, M3 form the main active capacitance, being 
laid out in thin strips to create inter-digitated bottom and top plates. N-well, Poly and 
M4 are used for shielding. 

The circuit is designed to work from a 2V supply. Single-ended current mirror 
OTAs are used with gain-boosted NMOS cascoded output stage [31] - see Fig. 10 for 
simplified version. Both the output stage PMOS transistors and input stage PMOS 
current source are have the same output resistance. With g ^ the transconductance of 
the PMOS input stage and ^outpMos output resistance of either the output stage or 
input stage current sources, Aq and CMRR are each given approximately by 
S m ‘ ^outpj^os' simulated to be at least 70dB (sufficient for our 12-bit 

embedded application). The parasitic input capacitance of section 3. is given 
approximately by the gate-drain overlap capacitance of one transistor (36fF) of the 
input differential pair together with the wiring capacitance at its gate (20fF). Hence, 
CjlC is given by 2/2 x 0.056 ~ 17 which equates to about 4 bits improved accu- 
racy (linearity) over previous charge transfer approaches. Dynamic comparators were 
used - see Fig. 11 - with PMOS input stages to enable fast switching for low level 
input voltages. In this design, RefP and RefN were created as 1.4V and 0.4V. 

The circuit diagram of Fig. 12 shows the switches and clock phases used to switch 
the capacitors referred to in Fig. 9. The start of a 12-bit conversion, ADC cycle 1, is 
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determined by aclk, at which time the output of the T&H is sampled across and 
Qa in series and also and Qb in series. The T&H signal is also fed to the 1.5-bit 
sub- ADC to determine the MSB. On cycle 2, the T&H signal is multiplied by 2 and 
the DAC output is subtracted off. This is then sampled across and Qa in series 
and Cjb and C^^,. It is also fed to the 1.5-bit ADC to determine the MSB-1. For the 11 




ADC cycle: i 2 3 4 - 12 1 

Phase: - 1 2 1 - 1 



Fig. 12. Schematic ofS&H, Summer and MX2 block. 
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cycles following acquisition of the T&H signal, aclk is low and aclkn is high, so that 
the differential output of the OTAs is used as input for each cycle of the ADC. The 
circuit works in a double sampled mode with clocks clk\ and clkl. There is no 
latency, where 12 conversions of the ADC produce a series of 12 bits from MSB to 
LSB. 



5. Measurement Results 

The ADC was prototyped in standard 0.25/zm CMOS. A picture of the die showing 
the partitioning of the blocks can be seen in Fig. 13. Detailed measurements were car- 
ried out on 40 samples. The main performance parameters are presented in Table 1. 
The complete ADC with DEC, clocks and comparators occupies 0.1 5mm^. The total 
power consumption including digital circuitry is 5.5 mW. Typical DNL and INL 
plots are presented in Fig. 14. DNL of less than 0.25 LSBs and INL of less than 
O.SLSBs at 12-bit level is achieved. The THD and SNR as a function of the sampling 
frequency is represented in Fig. 15(a), while a typical FFT spectral density plot is 
shown in Fig. 15(b) for a 200kHz 1 Vpp input signal. The THD at IMS/s was mea- 
sured to be 77dB. The SNR was 64.5dB which is lower than expected. This was 
attributed to the layout where there was coupling from the digital output buffers back 
into the analog supply. It is expected that a new version of the ADC, presently in dif- 
fusion, will alleviate this problem and have improved conversion efficiency. 




Fig. 13. Die photograph showing partitioning of blocks. 
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6. Conclusions 

A novel implementation of the 1.5-bit ADC stage for use in both algorithmic and 
pipelined ADCs has been presented. Voltage multiplication has been replaced by 
accurate addition and a floating hold buffer has been proposed for its implementation. 
The accuracy of the ADC employing the new circuitry is less sensitive to the match- 
ing accuracy and linearity of the sampling capacitors compared to previous circuit 
techniques presented in the literature. 

An analytical equation has been obtained of the transfer error of the proposed algo- 
rithmic stage and it is compared with the analytical equation of the traditional charge- 
transfer algorithmic stage. With C the value of the sampling capacitors and the 
parasitic capacitance at each differential amplifier input terminal, the proposed algo- 
rithmic stage has its sensitivity to capacitor mismatch reduced by a factor of 
Cj2C giving a corresponding improvement in linearity. Ensuring that C p^r C 
suggests a high resolution ADC stage (12-bits and beyond) can be practically realized 
without the need for calibration. 

Within the die area allocated to this work for an embedded application, 12-bit 
accuracy has been achieved at 3.3 MS/s using uncharacterized metal-metal capacitors 
and neither trimming nor calibration was needed. The ADC is suitable for embedding 
in CMOS digital VLSI and it will scale easily with the process without the need for 
special analog characterization. 

Table 1: Measurements 



Technology 


250nm standard industrial CMOS 


Resolution 


12 bits 


Conversion Rate 


3.3 MS/s with 40 MHz clock 


Clock cycles for 12 bits 


12 


Active area 


0.15 mm^ 


Power 


5.5 mW 


DNLat 12b 


< 0.25 LSBs 


INLat 12b 


< 0.8 LSBs 


THD at 1 MSamples/s 


77 dB 


SNR at 1 MSamples/s 


64.5dB 


SFDR at 1 MSamples/s 


80dB 


Power FOM 


1 .2 pJ/conversion 


Area FOM 


31 nm2 /conversion 
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Abstract 

Modem sensing applications often require high resolution 
conversion without compromising power consumption. In this 
Chapter, the maximum out-of-band quantization noise in delta- 
sigma converters is shown to play a key role in optimizing the 
performance of low power and high resolution delta-sigma 
analog-to-digital converters (ADCs). Two design examples will 
be highlighted that illustrate the use of these techniques to achieve 
high resolution conversion with low power consumption in deep 
submicron CMOS processes. 



1. Introduction 

Deep submicron CMOS processes often dictate different design styles and 
design approaches - particularly for analog circuit design [1,2]. As devices 
shrink in dimension, the tolerable signal swings reduce significantly. While for 
digital circuits this results in improved speed, transistor scaling can result in 
performance reductions for analog circuits. For example, to achieve the same 
ADC resolution when the signal swings are reduced, the noise floor must be 
reduced. This requires squaring the input capacitance when the voltage swing is 
halved for analog circuits limited by the kT/C noise. As a result, the power 
consumption is typically increased to achieve the same speed and resolution as 
the previous process generation. 

Delta-sigma A/D converters are particularly interesting for a wide range of 
applications because they can be customized using digital signal processing. 
The also achieve high resolution with less stringent requirements on device 
matching, but typically at the expense of high oversampling ratios and 
therefore, lower bandwidths. This chapter will illustrate a critical parameter in 
the design of delta-sigma ADC converters that allows extending the 
performance of the converters to wideband applications while achieving high 
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resolution. The basic background will first be given followed by two examples 
of recently fabricated converters. 

2. A High Resolution Delta-sigma ADC with High OSR 

The noise transfer function (NTF) of a delta-sigma modulator determines, in 
part, the overall resolution and the stability. A typical power spectral density of 
the quantization noise is shown in Fig. 1. The baseband boundary represents 
the input signal range for the converter and it is desirable to minimize the 
quantization noise in this band. The quantization noise amplitude increases for 
high frequencies and reaches a maximum at what is referred to as the maximum 
out-of-band quantization noise gain (Qmax). Prior work in delta-sigma ADCs 
supported limiting Qmax to ensure stable operation [5]. Further investigation 
has revealed that by optimizing this value, it is possible to increase the signal- 
to-noise-plus-distortion ratio of delta-sigma ADCs [6]. 




sigma modulator. 

For higher order A/D conversion, there are a number of high order delta-sigma 
architectures that are employed. One common implementation of a higher order 
delta-sigma modulator is the leapfrog architecture shown in Fig. 2. The 
interlocked feedback paths for the placement of zeros offer sufficient 
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insensitivity to the coefficient variations while providing higher-order loops for 
reduced signal-band tonal behavior. The complex transmission zeros in the 
quantization noise shaping characteristic are determined by the resonator 
feedback coefficients indicated by the b ’s in the figure. The quantization noise 
shaping characteristics and the stability properties are determined by both the a 
and b coefficients. 




To determine the maximum SNR that can be achieved, a fourth-order delta- 
sigma modulator is simulated as a function of the maximum out of band 
quantization noise gain. As shown in Fig. 3, the maximum SNR is achieved for 
a Qmax value of approximately 1.7. Beyond this region, the modulator is 
unstable. 




Fig. 3. Maximum SNR versus the maximum out of band quantization noise gain 

for a fourth-order modulator. 
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To optimize the design so that minimum power consumption is achieved, the 
integrators are designed to slew thus there are significantly less time constants 
required for settling. To illustrate the performance that can be achieved with 
this approach, two fourth-order delta-sigma ADCs are fabricated with the 
leapfrog architecture. A die microphotograph of two fourth-order delta-sigma 
ADCs is shown in Fig. 4 [8]. Both modulators are identical with the exception 
that one includes correlated double sampling at the input and the other does not. 
They are fabricated in a 0.6|Lim CMOS process operated from a single 3.0V 
power supply. The architecture as in Fig. 2 is used for this design and the Qmax 
is optimized to realize high resolution even with a low 3V CMOS process is 
employed. The reference signal is l.OV and the maximum input signal voltage 
is 2.V single ended, peak-to-peak. The total power consumption is 22.8mW and 
the design consumes an area of 8.7mm x 2.8mm. 




Fig. 4. Die microphotograph of two delta-sigma ADCs implemented in a 0.6 jJm 

3 V CMOS process. 

The measured output spectrum from the two designs is shown in Fig. 5. The 
lower measurement is the design including correlated double sampling it shows 
a nearly 20dB lower noise floor. In this design, the noise floor is limited by 
KT/C noise while in the non-CDS design, the noise floor is limited by the 1/f 
noise. The maximum out-of-band quantization noise is shown to approximately 
the same for the two designs. 

The measured SNR for the two designs is shown in Fig. 6. The modulators are 
clocked at 1.024MHz and a 50Hz input signal is applied. The results are plotted 
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for a bandwidth of lOOHz. The CDS modulator achieves an SNR of 22.4 bits in 
a lOHz bandwidth and 19 bits in a IkHz bandwidth. 




tliermal noise limited 

Fig. 5. Measured power spectral density of the delta-sigma modulators. 




Fig. 6 Measured SNR versus input signal level for the two delta-sigma ADCs. 

3. 14-bit Multi-path Delta-sigma Fabricated in a 0.18pm CMOS Process 

The leapfrog architecture is suitable for delta-sigma modulator designs that use 
moderate to high oversampling ratios. However, for low oversampling ratios, it 
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limits the achievable SNDR as illustrated in Fig. 7. In this figure, the lower 
limit on the maximum out-of-band quantization noise gain is shown by the solid 
lines for various orders of delta-sigma modulators. Below this range, the 
modulators will produce tonal behavior. At the lower part of the graph, the 
value of the maximum out-of-band quantization noise gain is shown for a stable 
S^-order leapfrog modulator. Notice that for very low oversampling ratios, 
there is little or no room for developing a delta-sigma modulator that falls 
between these two ranges. However, at very high oversampling ratios (much 
greater than 20), this architecture is well suited to achieving high resolution 
with negligible distortion. 

To determine the optimal maximum out of band quantization noise gain, the 
SNDR is simulated for 3^**, 5*, and 7* order modulators. The maximum SNDR 
is shown when the maximum out of band quantization noise gain is increased 
from 1 to 7. Notice that for a 5* order modulator, the SNDR can be increased 
by more than 25dB when Qmax is increased from 1 to 5. The primary 
challenge is to determine an architecture that can be implemented with higher 
Qmax values that does not have a high component spread. 




Oversampling Ratio 

Fig. 7. Maximum stable range of a 5’^ -order leapfrog and hybrid delta-sigma 

modulator 




357 




Fig. 8. Plot of SNDR versus the maximum out of band quantization noise gain. 



The upper dashed lined in Fig. 7 indicates another architecture that is suitable 
for very low oversampling while achieving much higher SNDR. This 
architecture is referred to as a mult-path modulator and is shown in Fig. 9. This 
architecture consists of second-order HR sections, shown in Fig. 10, embedded 
within the modulator. By embedding the second-order sections rather than 
cascading them, the requirements on the circuitry are relaxed. For example, in 
MASH architectures very high op amp gains are required, however, in this 
architecture, very low op amp gains are required to achieve high resolution. In 
the example below, it is shown that only 40dB of DC gain is required for a 14- 
bit ADC as compared to typically 80-90dB in MASH architectures. The 
architecture shown in Fig. 9 also includes a 4-bit ADC and DAC. Data 
weighted averaging is employed to attenuate the errors due to mismatches in the 
DAC. 

An additional feature of this architecture is that there is a simple relationship 
between the noise transfer function pole/zero locations and the modulator 
coefficients. This is primarily due to the multipath nature of the architecture 
where each biquad is decoupled and not a complex high order set of equations. 
Another benefit of this structure is the coefficient spread is small. In the case of 
a 5^^ order modulator with a 4-bit quantizer with 8X OSR, the coefficient spread 
is only 8 compared to more than 200 with a leapfrog architecture. 
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Fig. 9. Hybrid delta-sigma modulator architecture. 

IN 




Fig. 10. HR structure in Fig. 8. 

To illustrate the applicability of this approach for low oversampling, a 14-bit 
converter with 8X oversampling is also shown. The die microphotograph is 
shown in Fig. 1 1 where the chip consumes 2.8mm^. The modulator architecture 
is shown in Fig. 9 and it consists of a 17-level quantizer. The maximum out-of- 
band quantization noise gain of 6 is chosen to maximize the SNDR. The opamp 
DC gain is designed to be 43dB. This is particularly important since a 0.18|iim 
CMOS process is used where it is difficult to achieve high opamp DC gains. 

The converter achieves 83.9dB SNDR at 8X oversampling ratio and the 
measured results are shown in Fig. 12. The maximum input signal bandwidth is 
2MHz for this implementation. Measurements also illustrate the performance 
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degradation when the data weighted averaging algorithm is not used for large 
input signals. 



Fig. 11. Die microphotograph of the 14-bit delta-sigma ADC. 
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Fig. 12. SNDR and SFDR as a function of the input power for the 14-bit 

converter. 
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4. Conclusion 

The importance of the maximum out-of-band quantization noise gain is stressed 
to achieve high resolution and/or high speed delta-sigma analog-to-digital 
conversion. Two design examples are given to illustrate the application of this 
approach. Both achieve very good performance with relaxed circuit 
requirements. 
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Abstract 

Moving into ultra-deep-submicron CMOS technologies, the de- 
sign of S-A ADCs faces more challenges. Among them impor- 
tant ones are the restriction from the decreased supply voltage, the 
higher distortion and degraded device characteristics. The reduced 
supply voltage decreases the signal swing and lower noise floor is 
required to maintain the same signal-to-noise ratio. On the other 
hand, some advantages are gained. Important ones are the reduced 
threshold voltage of the transistor and lower power consumption 
of the digital circuits. In this paper, low-power low- voltage E- 
A ADC design strategies are introduced and design techniques in 
ultra-deep-submicron CMOS technologies are presented. As an 
example, a 1 V, 20 kHz, 88 dB, 140 /xW E-A modulator design in 
a 90 nm CMOS technology is presented. 

1. Introduction 

Driving by the speed demand of digital circuits, in recent years the deep-submicron 
CMOS technology is pacing into 100 nm range, i.e. ultra-deep-submicron. As 
a consequence, the power supply voltage drops below one Volt earlier than ex- 
pected. On the other hand, portable electronics with low-voltage operations find 
their big market. As a major building block, the analog to digital converter (ADC) 
is widely used in mixed-signal designs. As an interface between the analog world 
and the digital world, ADCs are implemented with the same technology as the dig- 
ital circuit hence the power supply voltage reduction is inevitable. The decrease of 
the supply voltage obviously results in analog circuits performance degradations. 

Among different ADCs, the E-A ADC is most suitable for high-resolution 
applications due to its high-linearity feature. The high linearity is gained by using 
the intrinsic linear 1-bit quantizer and the oversampling technique. However, the 
non-idealities in the building blocks strongly affect the ADC performances. To 
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Table 1: The SI A roadmap 2003 edition 



Year 


2003 


2004 


2005 


2006 


2007 


2008 


2009 


Feature size(nm) 


100 


90 


80 


70 


65 


57 


50 


Supply Voltage(V) 


1.2 


1.2 


1.1 




1.1 


0.9 


0.9 



design a S-A ADC, main efforts are put on fighting with the non-idealities of 
building blocks. The reduced supply voltage makes the situation even worse. 

In this paper, design difficulties of moving into ultra-deep- submicron technolo- 
gies are addressed first. Then several low-power low- voltage E-A ADC design 
strategies and special designed circuits in ultra-deep-submicron technologies are 
introduced. As an example, a low-power low- voltage E-A modulator in a 90 nm 
CMOS technology is presented. Finally several conclusions on the design of low- 
power low- voltage E- A modulators in ultra-deep- submicron CMOS technologies 
are drawn. 



2. Moving into Ultra-Deep-Submicron CMOS 

Table 1 shows the International Technology Roadmap for Semiconductors 2003 
edition from the Semiconductor Industry Association(SIA) [1]. From 2004, the 
sub- 100 nm CMOS technologies will be in the mass production stage. The de- 
velopment is driven by the speed demand of the digital circuit. However, for the 
analog design, the movement into ultra-deep-submicron technologies brings some 
design difficulties. The impacts of moving into ultra-deep- submicron technologies 
on the E-A ADC design are analyzed as follows. 

2.1. Decreased Signal Swing 

The signal swing defines the dynamic range for a system with a certain noise floor. 
In order to increase the dynamic range of a system, the signal swing should be 
maximized and the noise floor should be minimized. Unfortunately, the movement 
to ultra-deep- submicron technologies brings the restriction of the supply voltage, 
which leads to a small signal swing of the system. For a thermal-noise-limited 
E-A ADC, the dynamic range can be expressed as: 

DR = 10 log^ = 10 (1) 
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And the maximum input signal amplitude, Vinmax^ can be expressed as: 

^inmax ~ Vref • OL (2) 

Where is the input referred noise power, Vref and OL are the reference volt- 
age and the overload level, respectively. The overload level is the normalized input 
amplitude while the signal-to-noise ratio (SNR) decreases from the peak SNR by 
3 dB. For a certain E-A ADC topology, the overload level is fixed. One can only 
increase the reference voltage to increase the maximum input signal amplitude. 
But in the real circuit the reference voltage is limited by the output swing of the 
integrators, which is limited by the output swing of the operational transconduc- 
tance amplifier(OTA), and finally limited by the supply voltage. 

2.2. Degraded Transistor Characteristics 

Due to the channel length modulation, the drain-source resistance, Rds^ of a MOS 
transistor, decreases along with the reduction of the channel length. Fig. 1 gives 
the transistor Id — Vds characteristics of transistors with different channel lengths 
from the same ultra-deep-submicron CMOS technology. The intrinsic gain, de- 




(a) Transistor — Vos characteristics 
with W/L=8 /xmyO.8 /xm. 




(b) Transistor lo — Vos characteristics 
with W/L=0.8 iimJO.OS /xm. 



Figure 1: Transistor characteristics with different channel length in an ultra-deep- 
submicron CMOS technology. 



fined in (3), is the amplification ability of a single transistor. 

. 8m 2 • Id 1 2 

intrinsic " “ “ * 7^ “ ~ Fr)X 



( 3 ) 
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Figure 2: The transistor intrinsic gain and fj V5. channel length curves with Vgs~ 
Vt = 0.2 V and Vds = 0.3 V in an ultra-deep-submicron CMOS technology. 



The intrinsic gain of a transistor decreases along with the transistor channel length 
reduction. Fig. 2 shows the transistor intrinsic gain and fj curves vs. the transis- 
tor channel length. It can be seen that if the transistor channel length is increased, 
then the fj decreases rapidly. The gain-speed trade-off must be made here. 

Another problem with ultra-deep-submicron technologies is that the transistor 
channel noise increases while the channel length is shrunken [2]. Further more, 
the 1 // noise comer frequency is higher in ultra-deep-submicron technologies, 
making the design of high dynamic range S-A modulators more difficult. 

2.3. Distortion 

The main source of the distortion on the E-A modulator is the OTA inside the 
loop. In the low- voltage environment, transistors are forced to work at the rela- 
tively larger signal condition, especially for output devices inside the OTA. From 
Fig. 1 it is seen for ultra-deep-submicron technologies, the drain-source resis- 
tance, 2)5 of a transistor, has strong dependence on the source-drain voltage. 
The drastically changed source-drain voltage results in a big variation on the out- 
put conductance of the transistor, and generates more distortion. 
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Another source of the distortion is the input voltage dependent on-resistance 
of the switch. The main effort to solve this problem is making a constant on- 
resistance of the switch, which means constant overdrive voltage for switch tran- 
sistors, In the low-voltage environment, normally clock boosting circuits are 
needed to do so [3]. 

3. Low- Voltage Low-Power S-A Modulator Design 

The E-A modulator is always combined with digital circuits to form a system. 
As a result, the S-A modulator and digital circuits are fabricated on the same 
chip and work under the same low supply voltage. Different from the classic E- A 
modulator design, the design is in the low-voltage environment and some design 
strategies should be employed here. At the same time, the low-power consumption 
is normally addressed. 



3.1. Low- Voltage Strategy 

For a low- voltage E-A modulator, the signal swing is vital. The rail-to-rail out- 
put ability is essential for building blocks. The main building block in a E-A 
modulator is the OTA. OTA topologies that can offer rail-to-rail output swings are 
preferred. This demand is not only required by the signal swing consideration, but 
also by the distortion consideration. Shown in Fig. 3, the common-source output 
stage is often chosen for its rail-to-rail output ability. However, the insufficient 
gain of the single common-source stage forces designers turning into multi-stage 
OTA topologies. For a switched-capacitor E-A modulator, the DC gain of the 
OTA defines the accuracy while the slew-rate and settling time define the speed 
of the modulator. In the low-voltage environment, these requirements are more 
difficult to reach at the same time. Trade-offs are usually made to obtain some 
important parameters. 

The common-mode input and output voltage may be different in the low- voltage 
environment. To obtain the highest output swing, the output common-mode volt- 
age is set to the middle of the supply voltage. However, if the input common-mode 
voltage is also set to the middle of the supply voltage, then the rail-to-rail input ca- 
pability, which can’t be offered by the normal differential input stage, is required 
for the OTA. If the input common-mode voltage is set near either one supply rail, 
then the normal OTA can be used without the rail-to-rail input requirement and 
simplifies the circuits. 

For the modulator topology, the single loop topology is usually selected in low- 
voltage designs for its relaxed requirements for building blocks. And the single-bit 
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Figure 3: Output swing comparison of cascode and common-source stage. 



quantizer is preferred in low- voltage S-A modulators, as it is easy to implement 
and the requirements, such as the offset and hysteresis, are quite relaxed too. The 
low- voltage multi-bit quantizer requires the rail-to-rail input ability for compara- 
tors and the offset requirement is more stringent than the single-bit one’s [4]. 

The best strategy for low-voltage S- A modulator is to combine the circuit and 
system-level design techniques together. An example is the low-distortion E-A 
modulator topology [5]. In this topology the signal swing inside the loop can 
be made much smaller than that of the classic topology, which is most preferred 
in low-voltage E-A designs. This unique feature makes this topology a good 
candidate for low- voltage low-power E-A modulator designs. 

3.2. Low-Power Strategy 

The first and best low-power strategy is on the system level. To choose a better 
system topology that consumes less power is the most effective way to reduce the 
power consumption. For a E-A modulator with given specifications, the mod- 
ulator topology, sampling frequency and oversampling ratio are essential for the 
system power consumption. 

As described before, the reduction of the power supply voltage limits the signal 
swing. For the same dynamic range, the noise floor, or the mismatch error of a 
system, should be reduced as well. The noise floor, often restricted by the KT/C 
noise in the switched-capacitor circuit, can only be suppressed by increasing the 
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capacitance value. From (1), the dynamic range of a S-A ADC can be expressed 
as: ^ ^ 

DR = 10 logy = 10 — = 10 log — (4) 

Where OSR and Q is the oversampling ratio and sampling capacitance in the first 
integrator, respectively. From (4) it is clear that for a certain dynamic range, while 
signal swing is reduced, the oversampling ratio or the first sampling capacitance 
must be increased. On the other hand, the mismatch error, which is reversely 
proportional to the square root of the area of the device, can only be suppressed by 
increasing the size of the device. Increasing both sizes will need higher current to 
maintain the same speed, which means higher power consumption. In the system 
point of view, the larger signal swing leads to the lower power consumption. So 
the higher signal swing is also an important low-power strategy. 

The building block topology is also important for low-power design. In E-A 
modulators, the OTA consumes the most power, especially the one in the first in- 
tegrator. Among all OTA topologies, the single-stage topology is the most power- 
efficient one as it is load-compensated and no power is wasted in the compensa- 
tion. Since in low-voltage E-A modulators, OTAs work in relatively large signal 
condition and large capacitive loads are always present, so the load-compensated, 
class AB OTA is preferred in terms of low-power consumption [6]. 

One of the most interesting property of E-A modulators is the noise suppres- 
sion inside the loop. Utilizing this feature can result a large amount of power 
saving. For a single-loop Sigma-Delta modulator, the noise suppression in the 
node k can be calculated by 

OSR^^ ^ 

Fsup,k = ~^0.k + l)(n«/)^ (5) 

^ i=0 

Where F is the noise suppression factor, OSR is the oversampling ratio and Ui 
denotes the loop coefficient of the i-th stage. This property allows the sampling 
capacitances of following stages being scaled down proportionally to correspond- 
ing ratios. The only restriction is from the matching requirements. Reducing the 
sampling capacitance results the reduction of the load capacitance of the OTA and 
hence reduce the power consumption. 

For a single transistor, the ratio between the transconductance and the drain 
current, gmlln. describes the power efficiency of this single device. Fig. 4 shows 
the gml^D and fj as a function of the transistor overdrive voltage of a transistor 
in an ultra-deep submicron CMOS technology. In the sub-threshold region the 
transistor achieves the highest power efficiency. This region is called the weak 
inversion region. However, the transistor biased in this region is slow, as a large 
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Figure 4: The transistor gm/ln ratio and fy vs. Vgs ~ Vj curves in an ultra- 
deep-submicron CMOS technology. 



W/L is required. A compromise between the power efficiency and speed is made 
here. For ordinary applications, the weak to moderation inversion region (Vgs — 
Vt ^0.1 -- 0.2 V) is preferred, where both good power efficiency and speed 
performance can be achieved. 

4. Design of a Single-Loop Third-Order E-A Modulator in 90 nm CMOS 

To explore the possibility of designing high-performance S-A ADCs in ultra- 
deep-submicron CMOS technologies and verify the design strategies described 
before, a single-loop third-order S-A modulator was designed in a 90 nm pure 
digital CMOS technology. The technology features 9 copper layers of intercon- 
nect and low-K insulation. But no metal-insulator-metal (MIM) capacitor is avail- 
able in this technology. The goal is to design a high-resolution, audio bandwidth 
E-A modulator with minimum power consumption in this latest technology. 





369 



4.1. Topology Selection 

As mentioned before, under 1 V power supply, the non-idealities of the building 
block is much severe. So single-loop S- A topology, which is not sensitive to these 




Figure 5: Single-loop third-order S-A modulator topology. 



non-idealities, is chosen to implement this design. In this pure digital technology, 
no MIM capacitor is available. All capacitances should be implemented by means 
of the metal-metal capacitance or fringe capacitance. The matching properties 
of these kinds of capacitances are worse than that of the MIM capacitance. So 
the loop-coefficients are chosen not so aggressive comparing to [7] to tolerate 
matching errors. Fig. 6(a) shows a behavioral simulation result of the output 
spectrum of the modulator. More than 85 dB of the signal to noise ratio (SNR) 
has achieved by using the proposed topology. Fig. 6(b) shows the SNR of the 
converter vs. the OTA DC gain. The minimum DC gain requirement of the OTA 
is 30 dB to achieve a 85 dB SNR for the converter. 

Fig. 7 shows the output swing of each integrators normalized to the reference 
voltage while the input amplitude is 70% of the reference voltage. The main con- 
sideration in the reference voltage is the distortion. The higher reference voltage 
means the higher dynamic range and higher output swings of OTAs. However, the 
high output swing drives the output transistors in the OTA out of the saturation 
region. As a result, the transistor’s output impedance and the voltage gain are 
decreased drastically. This non-linear operation generates distortion. The lower 
reference voltage is, the less distortion is generated. As a compromise between 
the dynamic range and distortion, the reference voltage was selected to be 0.6 V, 
which can ensure all the output transistors work in the saturation region with some 
safe margins. 
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(a) Output spectrum of the proposed 
topology with a 20 kHz input signal. 



100 

95 

90 




OTA gain [dB] 

(b) SNR vs. OTA DC gain for the pro- 
posed 3rd-order single-loop Sigma-Delta 
modulator. 



Figure 6: Performance of the proposed topology. 



From (4) the required first sampling capacitance value can be calculated. The 
first sampling capacitor is set to 6 pF. The second and third sampling capacitor can 
be scaled down. According to (5), for the proposed topology, the noise suppres- 
sion of the first, seconded and third stage is 28 dB, 55 dB and 82 dB, respectively. 
Considering the matching property of the capacitors, both the second and third 
sampling capacitor are set to 0.4 pF. Accordingly the OTAs can also be scaled- 
down to save the power. 



4.2. Building Block Design 

As described previously, the rail-to-rail output swing is highly preferred for the 
OTA, and load-compensated OTA is preferred due to its high power-efficiency. 
The current mirror OTA is a good candidate for this task. However, the gain of 
the current mirror OTA is only a gain of a single stage amplifier. In ultra-deep- 
submicron technologies, the single stage gain is in the order of 20-30 dB ranges, 
which is not enough for the proposed topology. A gain enhancement technique is 
used in this design [8], and the DC gain has been enhanced to around 50 dB to 
fulfill the requirement with some safe margin. This OTA is load-compensated and 
with a class AB output stage, which both are good to lower the power consump- 
tion. 

Shown in Fig. 8, the OTA is a simple current mirror OTA with two shunting 
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Figure 7: Normalized output of each integrators of the proposed topology. 



VDDA 




Figure 8: The current mirror OTA with gain enhancement. 
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current sources. The DC gain of a current mirror OTA can be described as: 



Ao = gmi ■ B ro3 



2/pi B 
VgSI — Vt Id3 ■ ^3 



2 

(Vg5i — Vt))^3 



( 6 ) 



Where A 3 is the channel length modulation coefficient of the transistor M3 and 
Id3 = B ■ Idi- From ( 6 ) it can be seen that the gain of the current mirror OTA 
is still in the order of an intrinsic gain of a single transistor. For ultra-deep- 
submicron technologies, the gain is merely around 30 dB. With the current source 
shunting part of the current from the diode-connected transistor M2, the enhanced 
gain now is: 



Aen - gmi ■ B ro3 



2/pi B 
Vg51 — Vj Id3 ■ A 3 



(7) 



Where now Id 3 = {I — k) - B ■ so: 



1 2 ^ Ao 

l-k ' (Fg 51 - Vr)A3 1 - k 



( 8 ) 



From ( 8 ), it can be seen that the OTA gain is enhanced times. The gain 
enhancement can be adjusted by changing the k factor. In practice, the gain en- 
hancement is restricted by several factors. The first is the restriction of the OTA 
frequency response. Doing that actually decreases the non-dominant pole fre- 
quency and the phase margin. The criterion is to keep a safe phase margin. As 
the dominant pole is created by the load capacitance, so the higher load capac- 
itance the OTA drives, the higher k factor can be chosen. Secondly, in practice 
the matching of current sources determines the total gain enhancement. To ensure 
good matching between the tail current source and the shunting current source, 
both of them are mirrored from the same reference current source. Due to the 
matching error, the k factor can not be set too high. Practically the k factor can 
be set to 0. 8-0.9, which can enhance the OTA gain by 10-20 dB. The overall OTA 
DC gain can reach 40-50 dB by using this technique and the requirement of the 
E-A modulator can be fulfilled. 

Another low-power consideration is the class AB operation of the OTA out- 
put stage. Shown in Fig. 9(a), the class AB operation is made by driving the 
PMOS transistor M4 by a floating voltage source Vbias- This is a very crude class 
AB output scheme that the class AB characteristics are not perfect. But here in 
switched-capacitor circuits, the high slew rate is more important than the good 
class AB output characteristics. As long as the output settles to the required fi- 
nal value within one clock period, the settling procedure is not important here. 
Actually this class AB stage increases the current sourcing ability of the output 
stage, and then effectively increases the slew rate of the OTA. The floating voltage 
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(a) Class AB biasing of the output PMOS 
transistor. 
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Figure 9: The class AB operation of the output stage. 



source V^ias is implemented by a level-shifter, shown in Fig. 9(b). The transistor 
M5 and M6 is identical and have the same drain current. As a result, their gate- 
source voltages are identical too. By properly biasing the M5, a certain Vbias can 
be obtained. 

By using the techniques described above, a current mirror OTA with gain en- 
hancement was designed. Fig. 10 shows the simulated frequency response of the 
OTA using the 90 nm CMOS technology. The simulation was done under a 1 V 
supply voltage and 6 pF load capacitance. The k factor was set to 0.9. The DC 
gain of the OTA reaches 50 dB and the phase margin is 60 degrees with a GBW 
of 15 MHz. The total power consumption of the OTA is 80 /xW. 

The conmion-mode feedback (CMFB) circuits are implemented with a switched- 
capacitor (SC) circuit [9]. The advantage of the SC CMFB circuits is the lower 
power consumption and it is easy to make the common-mode bandwidth of the 
OTA higher than the differential-mode bandwidth. 

Another practical low-power consideration is the feedback scheme of the in- 
tegrator. Fig. 1 1 shows two different feedback schemes. The left one directly 
connects the feedback signal to one terminal of the sampling capacitor during the 
clock period C2, i.e. integration phase of the integrator. The right one uses two 
sampling capacitors to sample the input signal and feedback signal, respectively. 
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Frequency (Hz) 

Figure 10: Simulated frequency response of the proposed current mirror OTA with 
gain enhancement. 




Vrefn Vrefp Vrefn Vrefp CMi 



Figure 11: Different feedback schemes of the E-A modulator. 
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and then summarize these two signals during the integration phase of the inte- 
grator. Both of these two schemes accomplish the same function, while the right 
scheme uses more capacitors and switches. On the left circuit, the sampling ca- 
pacitor is discharged to either Vrefp or Vrefn during the integration phase, and 
during the sampling phase the sampling capacitor is charged to the input voltage. 
During the charge and discharge cycle, the voltage change on the sampling capac- 
itor is from either Vrefp or Vrefn to Vin, which is quite large. Most importantly, 
the charge current is provided by the preceding OTA. These large signal charges 
require a high slew-rate of the OTA and consume more power. However, on the 
right circuit, the large charge current is provided by the reference voltage. The 
OTA only charges the sampling capacitor from 0 to the input voltage, which is 
smaller. So the right circuit relaxes the requirement for the preceding OTA and 
consumes less power. Thus it is chosen in this design. 

Thanking for the relatively low threshold voltage of this technology, all switches 
can be directly driven without any clock boosting stage. Local drivers are em- 
ployed to drive switches. Minimum transistor lengths are used in these drivers 
and switches to lower the power consumption. 

The quantizer used in this design is a regenerative comparator followed by a 
static latch [10]. The requirements, e.g. the offset and hysteresis, for the quantizer 
are quite loose due to the single-loop topology. Moreover, the dynamic operation 
helps to lower the power consumption as well. 

4.3. Capacitance Implementation 

All capacitances in this design are implemented by metal wall structure, shown 
in Fig. 12 [11]. This structure uses the lateral capacitance instead of the vertical 
capacitance. In ultra-deep-submicron technologies, the horizontal spaces between 
metal lines are smaller than that of vertical space, resulting high unit capacitance. 
Further more, the horizontal geometric sizes are well controlled in these tech- 
nologies. Totally 8 metal layers are used to build the capacitance in this design. 
Calculation shows the unit capacitance is around 1.7 fFlixm?, which is higher 
than that of the normal MIM capacitance in mixed-signal technologies. However, 
the matching property is worse than that of MIM capacitance. As the loop coeffi- 
cients are defined by the capacitance ratios, robust loop coefficients are chosen to 
be more tolerant to the inferred-matched capacitance. Besides, this kind of capac- 
itances is widely used in this design as decoupling capacitors around the chip. 




Figure 12: Proposed metal wall capacitance structure. 



4 . 4 . Measurement Results 

The design was implemented in a standard 90 nm pure digital CMOS technology. 
No additional technology features, e.g. low- Vj- transistors, are used in this design. 
Both the analog and digital power supply voltage are 1 V. The chip core size is 
0.42 mm x 0.42 mm only. The chip was bonded on a ceramic substrate and then 
mounted in a copper-beryllium box to shield the chip from the external interfer- 
ences. The measurement was done by feeding high quality sinusoid signals into 
the modulator and capturing the output data by a logic analyzer. Finally the col- 
lected data was processed in the Matlab. Fig. 13(a) shows the output spectrum 
of an 11 kFlz input signal. It can be seen that the peak SNR reaches 85 dB and 
the peak signal to noise plus distortion ratio (SNDR) reaches 81 dB. Fig. 13(b) 
shows the plot of the SNR and SNDR vs. the input amplitude. A dynamic range 
(DR) of 88 dB has reached. The total power consumption is 140 p,W excluding 
the power consumption of the output buffer. Table 2 summarizes the measured 
performances. 



5. Conclusions 

A low-power low- voltage E-A modulator was designed and implemented in a 
90 nm CMOS technology. Measurement results show that it is possible to design 
high-performance S-A modulators in ultra-deep-submicron CMOS technologies. 
Table 3 shows the performance comparison of recently published low- voltage low- 
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(a) Measured output spectrum of an 
1 1 kHz sinusoidal input. 



(b) Measured SNR and SNDR vs. input 
amplitude. 
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Figure 13: Measured output spectrum and SNR, SNDR vi. input amplitude. 



Table 2: Measured performance summary 



Sampling frequency 


4 


MHz 


Signal bandwidth 


20 


kHz 


Over sampling ratio 


100 




Supply voltage 


1 


V 


Analog power consumption 


130 


/uW 


Digital power consumption 


10 


jmW 


Peak SNR 


85 


dB 


Peak SNDR 


81 


dB 


DR 


88 


dB 


Reference voltage 


0.6 


V 


Active die area 


0.018 


mm^ 
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Table 3: Performance comparison 



Names and year 


Vdd [V] 


DR [dB] 


BW [kHz] 


Power [/xW] 


FOM 


Keskin,2002 [12] 


1.0 


80 


20 


5600 


5.9e-6 


Sauerbrey,2002 [13] 


0.7 


75 


8 


80 


52e-6 


Dessouky,2001 [14] 


1.0 


88 


25 


950 


275e-6 


Peluso,1998 [15] 


0.9 


77 


16 


40 


330e-6 


This work 


1.0 


88 


20 


140 


1493e-6 



power E-A modulators whose supply voltage is not higher than 1.0 V. The figure- 
of-merit (FOM) is defined as: 



FOM = 



4kT fs OR 
P 



(9) 



Where k is the Boltzmann’s constant, T is the absolute temperature, /b and P 
are the signal bandwidth and power consumption of the E-A ADC, respectively. 
The FOM is a measure of the power efficiency of a E-A ADC, taking the signal 
bandwidth and dynamic range into consideration. However, the supply voltage is 
not taken into consideration here. To make a fair comparison, all the converters 
should be working at approximately the same supply voltage. From the table it is 
seen that this work achieves the highest FOM among these E-A ADCs. 

Following the low- voltage and low-power design strategies described in sec- 
tion 3., the power consumption of the proposed E-A ADC has been effectively 
decreased. For digital parts, the smaller transistor feature size helps to decrease 
the power consumption. Clocked at 4 MHz, the total digital part of this modula- 
tor consumes 10 /xW only. Comparing to other CMOS technologies, ultra-deep- 
submicron CMOS technologies, e.g. 90 nm technology, have some advantages to 
implement the low-power low- voltage E-A modulators. 



• The rated supply voltage is low in ultra-deep-submicron CMOS technolo- 
gies. As a result, the threshold voltage of the transistor is low too, which 
is advantageous to implement low-voltage applications. No special designed 
low-voltage circuits are needed, which simplified the circuits and lower the 
power consumption. 



• Due to the low threshold voltage in ultra-deep-submicron CMOS technolo- 
gies, for SC circuits, the switch can be driven directly without the clock boost- 
ing circuits, resulting a lower power consumption. 
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• For digital part in the converter, the shorter the transistor length is, the less 
power is consumed. That is the driving force of the technology scaling-down. 

• More metal layers can be used in ultra-deep-submicron CMOS technologies. 
For mixed-signal design, these metal layers can be used to build lateral ca- 
pacitances with the high unit-capacitance. 

• Better matching properties are found in ultra-deep- submicron CMOS tech- 
nologies, which is always helpful for analog designs. 

There are also disadvantages associated with ultra-deep-submicron CMOS tech- 
nologies. 

• In ultra-deep-submicron CMOS technologies the transistor characteristics de- 
grade significantly. The intrinsic gain of the transistor is insufficient for most 
applications and the linearity is bad. And in the low-voltage environment, 
these drawbacks are getting even worse. Another problem associated with 
ultra-deep-submicron technologies is the transistor thermal noise level and 
the 1 // noise comer frequency is higher than that of deep-submicron tech- 
nologies. The impacts of the increased thermal noise and higher 1 // noise 
comer frequency should be seriously considered in many mixed-signal and 
RF designs. 

• The supply voltage is limited in ultra-deep-submicron CMOS technologies. 
No node inside the circuits can be allowed to be exposed to voltages higher 
than the rated supply voltage, which limits the usage of some special cir- 
cuits, e.g. clock boosting and voltage doublers etc. And normally the low 
supply voltage leads a low over-drive voltage of the transistor, which forces 
the transistor works in the weak to moderate inversion region and partially 
compensates the high operation speed of the technology. 

In a word, for low- voltage E-A ADC designs, moving into the 90 nm technology 
is beneficial in the term of power-consumption. The disadvantage is the linearity 
problem, which can be solved by employing some novel system topologies. By 
using special design techniques, high-performance E-A modulators can be im- 
plemented using the 90 nm technology. The proposed design and measurement 
have proved it. 
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Abstract 

DSP-based FM receivers should be designed paying special attention 
to the implementation of the A/D converter. This paper overviews 
the available solutions for these systems by addressing advantages 
and drawbacks. Large emphasis is given to an implemented 
bandpass multibit ZA modulator, deriving its specifications from 
system requirements, and motivating each design option. 
Experimental results of both a test chip and a complete receiver are 
provided as well. 



1. Introduction 

Most analog FM receiver systems use a super-heterodyne architecture, as shown in 
Fig. 1 . The RF FM signal is shifted to a first Intermediate Frequency (IF), which is 
historically fixed at 10.7 MHz, in order to properly position the image component 
of the mixer. The signal amplitude is adjusted in gain by an Automatic Gain 
Control loop (AGC), to avoid saturation in the following stages. The IF signal is 
filtered by one or more high-selectivity ceramic or SAW filters, which attenuate 
undesired neighbor channels and out-of-band energy, thus reducing the system 
dynamic requirements. In some cases, the filter implements also the channel 
selection function. 




Fig. 1 - The super-heterodyne receiver architecture 

Nowadays, in the wireless broadband communication scenario, the general trend 
aims at the software-defined radio receiver, as an answer to the request of re- 
configurable and multi-standard systems [[1],[2],[3]]. In such systems, the receiver 
is capable of modifying the digital processing depending on its configuration. The 
software-radio approach can be adopted for FM receivers in the automotive 
industry as well, where car-radios have to deal with different broadcasting 
standards for the three main areas (Europe, America and Japan), and the consumer 
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market requires advanced functions (e.g. audio and channel equalization, and 
MPEG-1 Layer 3), along with the integration with multimedia devices (e.g. CD- 
players, phone handsets, and GPS navigation). State-of-the-art sub-micron 
technologies ease the design of high-frequency circuits, which can effectively 
digitize the IF signal; mixed-signal devices can thus be implemented, sampling the 
FM signal centered at the IF, and demodulating it by means of a programmable 
digital processor. 

This work at first describes the possible architectures for an IF-sampled system, 
focusing on the advantages and the drawbacks of each possible solution; then, it 
concentrates on the definition of the specification of the selected design for the 
ADC; finally, the architecture and the circuit design of the A/D converter is 
analyzed in detail. 

2. Architectures for a DSP-based FM Receiver 

FM transmission systems are among the oldest standards in wireless 
telecommunications. The actual broadcasting FM signal is the result of an 
evolution, during which the amount of transmitted information has increased, 
though maintaining backward compatibility. As a consequence, an FM receiver 
does not present a set of full specifications as for recent digital communication 
standards (GSM, GPS, UMTS, 802.1 la/b/g, etc ...). The broadcasting FM channel 
bandwidth is about 200 kHz and its carrier signal frequency ranges in between [88 
MHz - 108 MHz]. 

As a consequence an FM receiver has to be designed trading among several target 
specifications. For instance, in crowded channel situations, the key aspect is the 
selectivity of the receiver (i.e. its capability of selecting the desired channel even in 
presence of close and powerful adjacent channels). On the other hand, in presence 
of coarsely spaced FM stations, with weak signal amplitude, the receiver 
sensitivity (i.e. its capability of processing signal with very small amplitude, with a 
low-noise electronics) is of primary importance. These two possible approaches are 
complementary: for instance high-selectivity is obtained with high-Q and noisy 
filters, which, on the other hand, limit the sensitivity. 

Recently, the possibility of realizing DSP-based FM receiver has been 
investigated, and the resulting most popular architectures are the IF solution 
[[8],[9],[13],[14],[15],[16],[17],[18],[19]] (shown in Fig. 2) and the low IF (LIF) 
solution [[20], [21]] (represented in Fig. 3). 

In both cases the RF front-end is assumed to be the same with a 10.7MHz-IF 
bandpass ceramic filter, which is intended to protect the A/D converter from strong 
interfering signals. 

In the first architecture, the signal is digitized at the 10.7 MHz IF, and additional 
filtering and downconversion is performed in the digital domain. In both cases 2A 
modulators are typically adopted, providing the required SNR only in the 200 kHz 
bandwidth of interest. 
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Fig. 2 - The digital-IF receiver architecture Fig. 3 - The digital-LIF receiver architecture 



The second architecture is the LIF implementation; it uses an additional analog 
mixer to shift the signal to a lower IF before digitization; in this case, two ADCs 
are required to implement the image-rejection relative to the 2"^ mixer. 

2.1. IF-ADC approach 

The direct digitization of the channel at IF with a single ADC presents the 
following advantages: 

• the scheme uses a single digitizer (BP2AM), while I&Q (In-Phase & 

Quadrature) signals are separated in the digital domain (by means of a 
Cordic, for instance), guaranteeing ‘infinite’ matching; 

• the image frequency is far from the signal frequency, therefore no image- 

rejection filtering is needed. 

On the other hand, the digitalization of the 10.7 MHz signal requires more 

stringent requirements for the analog blocks (opamp, switches, oscillator, etc ). 

For instance, the resulting low / s/^f ratio corresponds to have few samples for each 
signal period, giving large voltage steps between consecutive samples. This results 
in the necessity of ensuring large slew-rate capability, to avoid distortion. Similar 
arguments are valid also for the sampling clock jitter, which results in a DR 
reduction. 

Regarding the possible implementations of this kind of architecture, the main 
alternatives reported in literature are: 

• switched-capacitor (SC) technique [[8],[9],[14]-[19]]: requiring higher 

speed for the active devices (opamps) but less sensitive to jitter and 
with more stable frequency response; the eventual anti-aliasing 
filtering function is implemented by the external ceramic filter; 

• continuous-time (CT) technique [[15], [20], [21]]: characterized by smaller 

bandwidth requirements for the active devices - and so less power 
consumption - but higher sensitivity to jitter and parameter spread, 
thus requiring a tuning circuit, which may affect the DR performance. 
On the other hand they do not need any anti-aliasing filter in front of 
the ADC. 

The choice of the sampling frequency is an important parameter for the overall 
performance. According to literature, the sampling frequency can be chosen as: 

• [[14], [16], [17]]: this allows a very efficient digital downconversion 
to baseband, but it may result in a DR reduction due to the folding of 
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the odd harmonics into the signal band. The image frequency is far 
from the signal frequency, therefore, no image-rejection is needed. 

• 3< fs/fiF <4 [[8], [15]]: the linearity performance is improved with respect 

to the previous case, but the demodulation requires a digital mixer 
circuit. As before, no image-rejection filtering is needed. 

• fs/fiF = 4/3 [9]; this is an undersampling technique. It reduces the 

bandwidth for operational amplifiers, but tightens the requirements for 
the image-rejection filter. The digital demodulation is as efficient as for 
/s/A=4. 



2.2. LIF & VLIF approach 

This approach exploits the consideration that high linearity is achievable at low 
frequencies (or at large /s/^f ratios) with relaxed requirements for the analog 
blocks in the ADC. A 2"^ mixer stage can be used to additionally downconvert the 
IF signal to a lower frequency, and separate the I and Q signals, which are then 
processed by two ideally identical paths. The 2"^ mixer can be implemented with 
an efficient fully-passive structure [[20],[21]]. When the IF is downconverted to 
dc, the system is called ‘"zero-IF” (ZIF); when the signal is shifted few hundred 
kilohertz from the dc, the system is named “near-zero-IF” or “very-low-IF” 
(VLIF). 

Also these methods have both advantages and disadvantages. In the ZIF, the 
baseband signal (at dc) accuracy is corrupted by the superimposition of the dc- 
offset (which is generated by path mismatches, and local oscillator leakage), and 
of the l/^noise. This motivates preferring the VLIF systems. They however 
require a high image-rejection (IR), due to closely spaced adjacent channels, and 
image. The achievable IR is limited by the negative-to-positive frequency 
crosstalk of the in-phase and quadrature signals, determined by the path mismatch. 
The Image Rejection (IR) depends on 



the matching of the phase/quadrature 
paths, and it can approximately be 
expressed as in (2), where /S.AIA 
represents the relative gain mismatch 
between I and Q paths, and A(^ is the 
phase error in radians [10]. 



IR = 10*logio 



A 



( 1 ) 






2.3. Other implementation considerations 

In the above discussion architectural issues have been addressed. However, the 
final performance of the receiver depends also on other parameters, more related to 
the particular implementation solutions adopted for the realization of the ADC. In 
the following sub-sections some of these parameters are introduced and their 
effects on the different implementation approach are compared. In this analysis 
literature-found silicon implementations will be considered. For the IF approach. 
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both SC and CT implementations are available in literature, while for the VLIF 
only a CT solution has been proposed. 

2.3.1 ADC input impedance 

The input impedance of the ADC is of primary importance in its coupling with the 
analog tuner front-end. For CT-ADCs, it is generally not critical, provided that 
proper impedance is taken into account in designing the tuner. On the other hand, 
when an SC-ADC is adopted, the tuner output stage is loaded by a switched- 
capacitor, whose value may be up to 8-10 pF. The resulting large (about 10 mA) 
impulsive current may decrease the front-end performance, unless the SC-ADC 
input stage be carefully designed. 

2.3.2 Frequency response accuracy and tuning system 

The accuracy of the time-constant of a CT circuit depends on the poor precision of 
the integrated components (with variation up to ±40% with respect to its nominal 
value). This motivates the use of tuning systems capable of controlling the 
frequency response of the CT-2AM [[15], [34], [35]]. This is more necessary for the 
BP case (IF-ADC), while for the LP case (VLIF-ADC) is less important, but 
necessary as well to achieve large DR by implementing an aggressive NTF. As a 
drawback, state-of-the-art tuning solutions may reduce the DR due to their 
interaction with the CT-2AM. 

On the other hand, SC2!AM are less sensitive to component spread, and thus at the 
first order the tuning can be avoided. As a general consideration, in both cases the 
use of a low-order BP2IAM limits the useful bandwidth, thus reducing the system 
robustness. To fit the bandwidth even for these cases, it may be necessary to adjust 
the frequency response also in SC systems and, on the other hand, it may disable 
the adoption of CT systems. 

2.3.3 Anti- Alias Filter 

Before digitization, an anti-alias filter is generally required to remove the image. 
However, in CT-ADCs the CT-loop itself implements this function. Similarly, in a 
digital-IF system, the IF ceramic filter removes the alias, allowing using an SC- 
ADC without a dedicated image-rejection filter. 

2. 3. 4 Sampling clock jitter 

According to the study of Tao e/ a/. [[11]] the sampling clock jitter sets an upper 
bound on the achievable dynamic range in both SC and CT converters. Eq.s (2), 
(3), and (4) give this upper bound for the case of SC, CT with a Non-Retum-to- 
Zero DAC and CT with a Return- to-Zero DAC, respectively. Fig. 4 reports the 
dependence of the maximum DR vs. the clock jitter for the three above cases, 
under typical conditions. 

2. 3. 5 Comments and Results 

A summary of the characteristics of three literature found solutions, an SC and a 
CT for the IF approach, and a CT for the VLIF approach is reported in Table I. Up 
and down arrows indicate respectively advantages and weak points. 
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Table I 

Characteristics of some ADC Implementations for FM receiver (DR>78dB) 





IF-ADC (BP2AM) 


VLIF-ADC (LP2AM) 


Implementation 


CT[15] 


SC [8] 


CT [21] 






10.7 MHz 


10.7 MHz 


300 kHz 






40 Mhz 


37.05 MHz 


41.6 MHz 




2AM Order & Structure 


Single-bit 
e* order BP 


Multi-bit 
2"" order BP 


Single-bit 
5’" order LP 




Power consumption 


Higher D- 


Higher 


Lower 




Additional Block Req.ments 


Anti-Aliasing Filter 


No t 


Yes 


No 




Image Rejection Filter 


No t 


No II 


Yes 




Tuning circuit 


Yes D- 


No/Y es 


Yes 




Sensitivity 


Clock Jitter 


Yes 


No ff 


Yes 


11 


Channel mismatch 


No t 


No ff 


Yes 


11 


(Gain and phase) 


Tuning system 


Yes 


No ff 


Yes 


11 


Cross Coupling Noise Sensitivity 


High t 


Low H 


High 


ff 



3. A DSP-based FM receiver 

According to the above discussion, a DSP-based FM receiver has been developed 
[8]. The partitioning of the receiver is shovv^n in Fig. 5. The car-radio system, in 
which the IF processor is embedded, operates under severe field conditions but it is 
nevertheless required to ensure high-fidelity audio quality. The chip photo is 
shown in Fig. 6. The device has been realized in 0.18 fxm CMOS technology, 
using a die area of 15.2 mm^. For this device, an SC IF-ADC has been chosen for 
its intrinsic robustness to process parameters spread, and to the noise of the 
environment. The main drawback of this choice is the power consumption larger 
than for the VLIF approach. However, most dissipation in a car-radio system is due 





389 



to the digital processing (hundreds milliwatts), and this makes the large power 
consumption of the SC-IFADC acceptable. 




Fig. 5- Single-chip IF processor architecture. 




Fig. 6- IF processor chip photograph. 



4. IF-ADC dynamic range design considerations 

Assessing the dynamic range (DR) of the ADC converter is a key issue in the 
design of the FM radio system, since it does not only determine the sensitivity of 
the receiver but it also establishes the specifications for the analog tuner front-end. 
Throughout this work, the dynamic range is defined as the ratio of the full-scale 
sinusoidal input signal power, to that which results in an SNR of zero over the 
band of interest. The case study is the superheterodyne receiver shown in Fig. 2. 
In the following, some design equations are derived, expressed in decibels, for 
assessing the AGC characteristic and the necessary ADC dynamic range, under the 
assumption of ideal noise-free tuner- front end. 

As an effect of the AGC loop, the antenna input signal is linearly amplified by a 
gain, until its level is below a threshold; otherwise it is saturated. In the latter 
condition, the input signal to the A/D converter is chosen to be the maximum 
allowed in order to have the ADC still linearly operating. It results that: 

f^AGC + Gtot = 

where, Fagc is the AGC threshold, Gtot is the gain from the antenna to the ADC 
converter input, and lADmax is the highest input signal of the ADC for which 
linearity in the conversion is guaranteed. 

Other design considerations can be derived in presence of both a desired and a 
neighbor channel, whose level is higher than the AGC threshold; the AGC 
attenuates both of them by the same amount, corresponding to the difference 
between the attenuated undesired signal level and the AGC threshold. 

Anyhow, the AGC-adjusted desired signal level has to be higher than the minimum 
acceptable antenna level. This situation, illustrated in Fig. 7, can be expressed by : 
Vd - [Ku - Fagc - A:f (a/o )] ^ (6) 
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where Fu is the undesired signal level, Fd is the desired signal level, ^CF(4^) is the 
attenuation of the ceramic filter evaluated atyip+A/, relative to IF, and ^ant^^in 

the minimum acceptable signal level from the antenna, for which the minimum 
acceptable SNR is achieved on the audio signal. 





Fig. 7 - Desired/undesired signal Fig. 8 - Signal levels for the receiver 

relations used in the definition of the AGC 

From equation (3), considering the case limit, it results that: 

=^'AGC + A:F(A/b)-f^anVin (7) 

This represents the fact that the maximum difference between the undesired and 
the desired signal level cannot be larger than the difference between the AGC 
threshold, at the undesired signal frequency, and the minimum acceptable antenna 
signal level. 

After having characterized the front-end using the previous equations, it is possible 
to derive the required ADC dynamic range; referring to the signal level diagram 
shown in Fig. 8, it can be inferred that: 

+ Gjo, = (FaDps - DR) + (SNR^j„ - Gdem) (8) 

where is the ADC full-scale signal level, DR is the dynamic range of the 

ADC, SNRrnm is the minimum acceptable audio SNR, evaluated in a 15 kHz 
bandwidth for FM, and Gdem is the demodulation gain defined as the ratio between 
the signal-to-noise value at the ADC output and the audio SNR. 

Eq. (5) expresses the condition that the minimum acceptable antenna signal level is 
amplified in order to reach a level higher than the noise floor, resulting from the 
A/D conversion, by the minimum acceptable SNR at the output of the A/D 
converter. 

The combined results from equations (2), (4) and (5) are shown in Table II. For an 
FM modulated signal with a 40-kHz frequency deviation, the demodulation gain, 
considering also 50p,s-deemphasis filtering, is about 26 dB, provided that the 
demodulator is operating above threshold. As a practical critical condition, a 200 
kHz-off neighbor channel can be 75 dB higher with respect to the desired signal. 
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Typical ceramic filter attenuation at this frequency is 20 dB, relative to IF. The 
ADC full scale and its maximum input signal level guaranteeing linearity are 
chosen based on design experience. It results that the starting point of the AGC is 
55 dBp,V while the antenna-to-IF gain is 59 dB; the dynamic range required for the 
ADC to meet the system specifications results in 78 dB. 

Simulations of the described radio receiver have been used to validate the proposed 
design methodology. Simulated audio SNR, evaluated over a bandwidth of 1 5 kHz, 
is reported in Fig. 9, in conjunction with experimental results of the implemented 
system. The simulated curve saturates to an unpractical 97 dB of audio SNR, while 
the measured is 30 dB lower; this can be explained by the fact that the tuner VCO 
phase noise was not modeled in the simulator, and this results to be the limiting 
factor in the saturation region. However, the simulation model is appropriate in the 
linear region. 



Table II - Determination of ADC DR 






Description 


Symbol 


Unit 


Value 


Specifications 


Maximum undesired/desired ratio for 




dB 


75 


SNR,,„ 


ADC maximum level input for linearity 




dBfiV 


114 


Minimum acceptable antenna signal 




dB^iV 


0 


level 


Minimum audio SNR 


SNR^i„ 


dB 


40 


Demodulation gain 


Cdem 


dB 


26 


Ceramic filter relative attenuation at Af 


AfCVo) 


dB 


20 


ADC full scale 




dBM-V 


123 


Design Variables 


AGC threshold 


Fagc 


dB 


55 


Gain from antenna to ADC input 


Gtot 


dB 


59 


ADC dynamic range 


DR 


dB 


78 



In Fig. 10, the power spectra of the signals at various stages of the system prove 
that the AGC characteristic, derived from equations (2) and (4), allows handling 
undesired signals higher than the desired up to 75 dB. The interferer does not 
decrease the audio SNR since the desired is still recognizable at the ADC output, 
as imposed by equation (4), and because the undesired is completely removed by 
the channel filtering. 
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Fig. 9 - Audio SNR evaluated over 15 
kHz bandwidth, for IkHz mono signal 
using 40 kHz frequency deviation, in 
function of the tuner input signal level. 
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Fig. 10 - Simulated power: (A) ideal tuner 
input signal, (B) signal at ADC input, (C) signal 
after A/D conversion, (D) complex baseband 
signal. Desired singnal has 40 kHz deviation, 
while the interferer is an unmodulated carrier. 



5. The Bandpass ZA Modulator 

The technological trend toward scaled-down device sizes is a key issue for the 
realization of single-chip systems, which include both analog and digital parts. 
While for the digital blocks the scaled down technologies allow a reduction in area 
and power consumption, for the analog sections the relatively poor analog 
performance of the scaled MOS devices results in a considerable design challenge, 
which makes it difficult to achieve the performance previously obtained with 
larger minimum size device. This motivates the research of novel architectures for 

the mixed signal blocks (like ADCs, DACs, etc ), which take advantage of the 

improved digital circuitry in order to compensate for the poorer analog circuitry. 
The IF-ADC, embedded in the presented DSP-based FM receiver, is a BandPass 
2A Modulator (BPSAM). The following key choices characterize the proposed 
device: 

• use of a SC implementation to guarantee performance robustness versus 

clock jitter; 

• use of a 2'^^-order modulator (the minimum order for a pure BP solution) to 

reduce the amount of analog circuitry, and so the analog penalty; 

• use of a single opamp for the realization of the 2”^^-order loop filter 

(resonator), to reduce power consumption 

• use of a multibit (3 3 -level) quantizer to increase the relatively low 

Dynamic Range (DR) of the 2nd-order modulator and, in addition, to 
improve system stability 

• use of a bandpass Dynamic Element Matching (DEM) technique to 

linearize the multibit feedback DAC; 

• use of a self-calibrating system to increase the accuracy of the Noise 

Transfer Function (NTF) notch bandwidth position (fo) and depth (0, 
in order to avoid DR losses; 
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• use of an input switched-buffer to reduce input impulsive current sunk 
from the previous block driving the BPSAM. 

Further details on these choices are given in the following sections. 

5.1. The BP2AM Architecture 

The adopted BP2AM architecture is shown in Fig. 11. The fundamental blocks 
are: the input switched buffer, the bandpass loop filter, the 3 3 -level quantizer, and 
the multibit feedback DAC with DEM circuitry. In addition a dither signal is 
generated and injected in the signal path. Finally the NTF /o and Q are adjusted 
with two dedicated control systems. 

In the architecture of the FM receiver of Fig. 2, the BP2AM input signal is the 
output signal either of the analog mixer or of the ceramic. The BP2AM input 
switched-capacitor, being some pFs large under low-noise and robust design 
considerations, generates large input current spikes, which are detrimental to the 
performance of the previous block. For this reason a high-performance switched- 
buffer has been conceived, that significantly reduces the amplitude of current 
spikes at the input. 




Fig. 11 - The adopted overall BPIAM Frequency [MHz] 



architecture Fig. 12 - Output spectrum with NTF error 

Regarding the BP2AM, a 2"'^-order structure has been adopted, since it presents the 
minimum amount of analog components for the realization of a "true” bandpass 
system. Previous solutions use single-bit higher order modulators to achieve 
similar performances, however they suffer from the stability issue, which force 
reductions in the DR. 

Thus, a 33-level quantizer is adopted. The multibit quantizer gives 6dB 
SNR-improvement for each extra bit, and increases the system stability. This is 
also important in order to implement a more aggressive noise shaping, which 
improves the accuracy. In addition, the multibit quantizer linear behavior reduces 
the spurious-tones typically present in F^-order structures. However, for very low- 
level signals, the multibit quantizer operates as a single bit quantizer. This results 
in the relatively poor performance of the 2”"^-order BP2AM (which corresponds to 
a F^-order single-bit low-pass modulator). To reduce this weakness, a digital dither 
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signal at /s/4, with amplitude of a single LSB, is added to the quantizer output. 
Such a dither is necessary, since the noise shaping could be lost at low signal 
levels, resulting in a significant DR loss. 

The main problem for using a 3 3 -level quantizer is the linearity requirement of the 
feedback DAC. Since it is in the feedback loop, its accuracy needs to be at least as 
good as the converter resolution in order not to deteriorate the BPS AM 
performance. Several solutions to relax the linearity requirements have been 
proposed in literature. In this design, the linearity is ensured by means of a DEM 
algorithm, which implements a 2”*^-order bandpass noise shaping algorithm. 

On the other hand, the use of 2”^-order loop filter gives a NTF notch bandwidth 
(~300kHz), which is slightly larger than the FM signal bandwidth (~200kHz) 
centered at 10.7MHz. This arrangement reduces the DR losses, which occur with a 
misadjusted notch location or bandwidth (as shown in Fig. 12), which can be due 
for instance to a 0.5% capacitor mismatch, to a 200MHz opamp bandwidth, and to 
a gain lower than 80dB. These requirements cannot be relaxed in order to achieve 
high yield in mass-production. Thus, performance robustness is guaranteed also by 
means of two calibration algorithms, which control the frequency (fo) and the 
quality factor (Q) of the NTF zeros. 

5.2. Switched buffer 

The switched-capacitor (SC) circuits in the sampling structure of Fig. 13. a (where 
a superlinear switch [12] is assumed to be used), due to their sampled-data nature, 
operate with large transients, which correspond to large input current spikes. The 
amplitude of these current spikes depends on the size of the capacitor, on the 
conductance (1/Ron) of the switches, and on the signal amplitude. 




switch 

(a) (b) (c) 

Fig. 13 - Two possible input branch solutions 

The plot at the top of Fig. 14. a shows the input current required by the BP2AM 
(with a 5.8 pF input capacitance) for full-scale input signal amplitude (4 Vpp) with 
a standard SC input branch of Fig. 13. a. The input current spikes, even in the 
presence of the real series switches, are large up to 10 mA and they may 
deteriorate the performance of the previous stage that has to source them. A more 
detailed view is given in Fig. 14.b where the upper plot shows the current 
amplitude during the different clock phases. A first level solution to reduce these 
spike effects is to insert a buffer, which is always connected to the previous stage 
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and drives the input SC-branch. In this way, the previous stage is loaded with 
constant (i.e. non-switched) impedance and, thus, it is not required to source 
current spikes. This solution is expensive in terms of area and power consumption. 
In fact, the input buffer must exhibit high-quality performance in order not to 
deteriorate the modulator performance in terms of frequency response, noise, and 
linearity. Therefore, the buffer has to exhibit a large gain and unity-gain bandwidth 
in order to guarantee an accurate sampling operation. In addition, the contribution 
of the input buffer to the sampled voltage in terms of noise and distortion must be 
minimized. This can be achieved at the cost of large area and power consumption. 
It should be noted that bandwidth, noise, and linearity are not critical if the 
standard un-buffered passive SC structure of Fig. 13. a is used. 




Time 



(a) overall waveform 



Time 

(b) detail of the waveform 



Fig. 14 - Comparison of the input current for the two solutions 



In the proposed design, a switched buffer [[8]], shown in Fig. 13.b, has been 
adopted. The sampling phase (1) of the standard scheme of Fig. 13. a is divided in 
two parts as shown in in Fig. 13. c (la and lb): 

• in the first part (la) an active buffer (always connected to the input 

voltage) pre-charges the sampling capacitor Cs to a voltage close to the 
input voltage and, in particular, it reduces the level of the input current 
spikes, which are larger during the initial part of the sampling phase; 

• in the second part (lb), the active buffer is disconnected from sampling 

capacitor and the final settling is achieved with a standard passive 
switch (superlinear in this implementation), which ensures high 
linearity, low noise and large bandwidth. 

This solution is advantageous for the following reasons: 

• the active buffer is not required to have high-speed response, since the 

final value is fixed by the switch performance; 

• the accuracy of the transition between the two phases la and lb of the 

switched buffer is not critical: thus their generation can be very simple 
and with a small overhead; 
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• the capacitor is pre-charged to a value close to the final one. This is more 

efficient than in [[31]], in which the capacitor is pre-charged to the 
previous sample. This approach could be not only ineffective but also 
dangerous in our case due to the small ratio fs/fo\ 

• the level of the current spikes is reduced by 85%. 

A further optimization [[29]] of the complete switched-buffer is possible in order 
to pre-charge the capacitance internal to the superlinear switch with the same 
buffer, avoiding the supplementary load, which is typically applied to the input 
signal source. 

The effectiveness of the proposed solution can be appreciated from Fig. 14.a and 
Fig. 14.b, where it is compared to the standard solution. For the switched buffer a 
single stage CMOS opamp with 6 mA current consumption is adopted. The bottom 
plot of Fig. 14. a shows the impulsive input current for the switched-buffer for the 
overall input waveform. The current spikes are limited to a maximum value of 
about 1 .2 mA, which is a factor of about 8 times lower than the standard solution 
(the input current is 0.19 mArms and 0.76 mArms respectively, for the solution with 
and without the switched-buffer). A detailed view of this behavior is given in Fig. 
14.b. It can be observed that for each phase there are two current spikes 
corresponding to the edge of the phases la and lb. At the begining of phase la, the 
spike is due to the current necessary for the buffer capacitance, while the current 
spike at the beginning of phase lb is due to the charging of the sampling capacitor. 
The above simulation shows the achievable large reduction of the input current 
spikes, which allows different stages to be coupled together without deteriorating 
their stand-alone performance. 

5.3. Loop Filter Design 

The 2”^-order BP loop filter is realized using SC technique, operating with a 37.05 
MHz sampling frequency. This value has been chosen since it is a multiple of 
frequencies available in the system (the 57 kHz RDS carrier and the 50 kHz VCO 
tuning frequency step). Fig. 15 and Fig. 16 show the architecture and the SC 
implementation of the BP2AM, respectively. 

A single time-shared opamp for 2”^ order resonator is used. This solution results in 
a 35-40% power saving with respect to the two-opamps solution. This is also 
slightly area effective since the opamp area is much smaller than the capacitor area 
(see chip photo in Fig. 18). On the other hand, in the single opamp solution, two 
critical switches have been added in series with the two integration capacitors (Cb, 
Cd in Fig. 16). These switches are connected to the output node in order not to 
corrupt the charge conservation at the virtual ground node. However, according to 
this, their conductances depend on the signal amplitude. Thus, super-linear 
switches are used in order to prevent degradation of the SNDR performance. With 
the single opamp solution the layout design requires attention to avoid path 
crossing between the different connections to the virtual ground, which would 
result in parasitic charge transfers during the two different integration phases. 
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A linearity improvement (of about lOdB in IMD) has been experimentally obtained 
by sampling the input signal with capacitor Cini toward ground instead than 
towards a virtual ground, whose accuracy depends on the opamp bandwidth 
performance. 



fo-control 
C?-control : 



O-control 




Fig. 15 - Modulator architecture 



Fig. 16 - Single-opamp structure SC 
implementations of the BPZAM 



The BP2AM implements the following loop filter transfer function (H), the Signal 
Transfer Function (STF) and the Noise Transfer Function (NTF), under the 
assumption of ideal performance for the analog parts of the BP2AM: 

-1 

( 9 ) 






z"^ + 0.4827 -z"^+l 



5TF(z~*) = 



-1 



NTF{z~^) = 



0.4827 -z'Ul 
z~^ +0.4827 -z~^ +1 

0.4827 -z'^ + l 



( 10 ) 

( 11 ) 



5.4. The fo and Q Calibration Block Design 

In a BP2AM several non-idealities (mainly the capacitor mismatch and the opamp 
finite gain and bandwidth) cause a displacement of the NTF zeros from their 
nominal position. Therefore, even though an SC implementation has been used, a 
calibration system for the positioning of the NTF zeros has to be adopted [[28]] to 
guarantee performance robustness even in production. The nature of the mentioned 
sources of inaccuracy with respect to the required precision makes ineffective the 
use of master-slave calibration structures [[34], [35]] (which are based on 
component matching). Instead a two-step self-calibration approach has been used 
in the device: 

• in the first step the resonator quality factor (Q) is increased in order to 

guarantee a certain depth in the NTF notch 

• in the second step the notch frequency (fo) is adjusted at the center of the 

signal band. 

Notice that the Q-tuning is not strictly necessary for the modulator performance 
(thermal noise is dominating quantization noise in the signal band) but it is needed 
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to guarantee the proper working of the /o-tuning algorithm. The calibration 
algorithms are based on the fact that the nominal loop filter transfer function H (2 ') 
(and, as a consequence, the loop filter nominal transient response to an input 
impulse) is known a priori. So the modulator structure can be reconfigured in 
order to measure some specific parameters in its transient response, which are 
correlated to its frequency response. In both the calibration algorithms, the BP2AM 
is reconfigured by turning off the feedback DAC. A very-simple training signal (a 
one clock cycle impulse) is applied at the input of the loop filter (operating without 
the feedback DAC, i.e. as a resonator). The number of the zero-crossings with 
respect to a given threshold at the output of the loop filter is counted in the digital 
part and compared to its nominal value. This solution is convenient for two 
reasons: for the natural presence of the quantizer at the loop filter output and for 
the simplicity of the training signal. The elaboration of the number of zero- 
crossings allows to adjust the loop filter frequency response by setting the value of 
parameters Cci and Cc 2 (responsible for g-factor) and Cc (responsible for fo). 

The ^-factor calibration block is designed to maintain the g-factor at a level above 
a certain value. This is done by adding a SC positive feedback (Cci and Cc 2 ) on 
both integrators. This positive feedback is accurately controlled by the algorithm, 
in order to avoid system instability. In this way the algorithm prevents the g- factor 
from exceeding a given value. The capacitor ratios Cci/Ceand Cc2/Cd have to be 
very small (in the range of 0.016-0.8), and they cannot be realized directly using a 
single capacitor. In fact, Cci and Cc 2 have to range between fractions of one fF to 
several fFs. Since in the adopted technology the minimum feasible capacitor is 50 
fF, a digitally programmable T-network has been adopted. 

The fo calibration block is used to move the center frequency by changing the 
resonator feedback capacitor Cc, which is realized with a 15-capacitor array 
controlled by the f control algorithm, with a minimum step of 21 fF. 

These calibration procedures allow the NTF notch frequency fo to be set with a 
resolution of 40 kHz, which meets the application requirements. The digital 
calibration circuitry requires a maximum time of 28,800 time slots (i.e. less than 
0.8 ms) and occupies an area as small as about 3.5 kgates in the 0.18 pm 
technology. 

Such a proposed self-calibration procedure disables the signal processing, so it has 
to be executed according to the timing of the overall system in which the BP2AM 
is embedded. 



5.5. Analog Block Design 

The minimum feasible capacitor size for a robust design in the adopted technology 
is 50fF, which has been applied as the minimum capacitor in the fo and Q 
calibration circuits. As a consequence, the BPS AM capacitors result to be the 
following: CA=2pF, CB=1.12pF, Cc=7.94pF, CD=5.7pF, Cci=3.2pF, Cc2=3.2pF, 
Cini=3.19pF. The unitary-element capacitor in the feedback DAC is lOOfF. Notice 
that, in order to optimize the opamp performance, the capacitive load and the 
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feedback factor have been maintained constant in both the opamp operating 
phases. With the above values, the opamp load is up to 15pF. This makes the 
thermal noise of the switches (kT/C) negligible with respect to the opamp noise. 
The capacitive loading and the following considerations make the design of the 
opamp of key importance. The opamp has to exhibit: 

• low noise (to match the DR requirement) 

• high speed (to settle in half a period of the sampling frequency) 

• high gain (to reduce the integrator phase error) 

• high slew rate (to reduce distortion and to properly ’jump' in the single 

opamp configuration) 

• high output swing (4Vpp differentials from a 3.3V power supply). 

The class AB single stage output boosting structure, shown in Fig. 17 [[32], [33]] 
has been used. The input stage devices operate in the sub-threshold region in order 
to achieve a large transconductance. An eventual large offset may be sustained by 
the bandpass application. 




Fig. 17 -The opamp schematic 18 - Chip photograph 



The typical open loop opamp performance (taking into account the feedback 
factor) is: DC-gain=112 dB, unity-gain bandwidth=350 MHz, current 
consumption=17.6 mA. This nominal opamp performance does not require any 
calibration, however in technological worst case conditions it still requires the use 
of fo and Q tuning circuits. 

The 33-level quantizer is realized with a flash architecture. The number of levels is 
relatively high in order to increase linear range and DR. The case of the digital 
output word with all O's corresponds to no unit elements being selected and does 
not require any DEM algorithm. Thus the DEM is applied to a 32 levels. Each 
level is selected by a comparator, which is the cascade of an auto-zero SC 
amplifier, to avoid kick-back noise, and a latch. The reference levels are 
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implemented with two resistive strings. Finally, the 33-level quantizer total current 
consumption is 4.5 mA from a 3.3 V supply. 

5.6. Experimental results 

The proposed BPS AMs have been integrated in a 0.18 \im CMOS technology, 
which features two oxide thicknesses. This makes available two MOS devices: a 
0.18 pm CMOS with a 1.8 V maximum supply and a 0.35 pm CMOS with a 3.3 V 
maximum supply. The technology features 6 metal levels and high-linearity metal- 
metal capacitor. The device has been realized in a test-chip, where the clock is 
supplied by an external signal generator, and also embedded within a complete 
fully-integrated mixed-signal FM receiver. In the latter case the clock is generated 
on-chip by a 3*^^ overtone quartz oscillator, which guarantees a better jitter 
performance (lower than 10 ps„ns)- Fig. 18 shows the chip photograph of the 
device in the stand-alone version. The die active area is about Imm^ In the photo 
the different parts of the device have been indicated. The large portion of the area 
is dedicated for capacitors and calibration blocks, which makes the opamp area 
relatively negligible. In the realized device the current consumption of the analog 
part is 22 mA from 3.3 V, while the digital part consumes 8.5 mA from 1.8 V. In 
addition, the eventual use of the switched-buffer requires a 12 mA additional 
current. 

The experimental results have been obtained with a dither signal, which can be 
clearly seen in the output spectrum at fs/4. The effect of the dither is a DR 
improvement of about 2dB with respect to the case when the dither is tumed-off. 
Fig. 19 shows the output spectrum for a -20dBFS input signal in the passband. 
Similar performance has been obtained for the test chip and in the complete 
receiver. In addition, in the receiver where the switched-buffer can be enabled, 
similar performance is also achieved with and without the switched-buffer, which 
is then demonstrated not to be the limiting block. The corresponding SNR vs. input 
amplitude is shown in Fig. 20 for the FM (200 kHz) bandwidth. For all of the 
versions a considerable DR and SNRpeak are achieved. A minimum SNRpeak of 67 
dB is performed in the test-chip version, while for the receiver version it is larger 
than 70 dB, and reaches 72 dB activating the switched-buffer. 



Frequency (MHz] 

Fig. 19 - Output spectrum @-20 dBFS input pig. 20 - SNR vs. input signal amplitude 
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Frequency [MHz] Frequency [MHz] 

Fig, 21 - IMD measurement Fig. 22 - IMD measurement with DEM algorithm 

turned off 



The DR is better for the receiver version (a minimum value of 78 dB is achieved in 
production, with some samples featuring up to 81 dB), in spite of the noisy 
environment, due to the better jitter of the on-chip quartz oscillator. This indicates 
that the external clock jitter is the limiting factor of the test-chip version. This is 
also valid for the SNR-peak as shown in the magnified inset within Fig. 20. 
Regarding the effect of the switched-buffer, it results in a 1 dB DR loss, which is 
negligible for the application. On the other hand, the switched-buffer becomes 
effective for large signals, i.e. it gives a IdB SNR-peak improvement, as can be 
seen in the magnified inset. If the inputs are shorted on-chip (mute state), the DR 
(mute-DR) moves to 80 dB for the FM band and 95 dB for the AM band. 
Moreover, a DR improvement has been observed by reducing the sampling 
frequency. Using /=35 MHz the mute-DR is larger than 95 dB (for FM band) and 
this is within IdB of the MATLAB simulated value. For this reason, the limiting 
factor could be the opamp finite speed of response (bandwidth and/or slew-rate). 
The linearity of the modulator has been evaluated with a two-tone intermodulation 
(IMD) test 0^7=10.71 MHz,/2=10.77 MHz). For two tones at -1 1 dBFS, the IMD is 
about -65 dBc (as shown in Fig. 21). 

The effect of the DEM can be 
seen in the IMD measurement. 

The output spectrum with the 
DEM algorithm turned off is 
given in Fig. 22, where a 
larger IMD can be observed. 

The DEM gives an IMD 
improvement of about 19 dB. 

Table III summarises features 
and performance. 

Finally the effect of the 
calibration system can be seen 
in Fig. 23. Fig. 23 - Output spectrum before and after calibration 




Frequency [MHz] 
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The curve on top refers to the uncalibrated case: an error in the notch frequency of 
about 150kHz is present (>1% error). The bottom curve corresponds to the 
calibrated case. The notch frequency is centered at its nominal value and this gives 
a 3dB SNR improvement. This demonstrates that even in the presence of 
significant technology spread and opamp performance loss, the BP2AM exhibits 
the maximum achievable performance in the signal band. 

6. Conclusions 

A 2'^'^-order multibit BP2AM has been realized both in a stand-alone test-chip and 
also embedded in a complete fully-integrated mixed-signal FM receiver which 
generates a noisy environment. In the final application solution (the receiver 
version with the switched-buffer on), the modulator features 78dB DR, 72dB peak- 
SNR within a 200kHz bandwidth (FM bandwidth) with a 84 mW power 
consumption. The IMD for a -lldBFS two tones test is about -65dBc. These 
experimental results demonstrate the validity of the proposed design especially in 
terms of performance robustness, since they have been achieved when the BP2AM 
is embedded in a complete FM receiver. 

Table III - BP2AM Performance Summary 



Parameter Value 



fjfo 


37.05MHZ/1 0.7MHz 


Signal bandwidth 


200kHz (FM) 


OSR 


92.6 (FM) 


Architecture 


Single-loop 


Modulator order 




Input full-scale 


4Vpp 


Technology 


CMOS 

0.18/ym @ 1.8V 
0.35/ym @ 3.3V 


Active die area 


1mm^ 


Test chip (FM band) 


DR = 76 dB 


Switched-buffer OFF 


IMD (-lldBFS) =-65 dBc 


Full receiver (FM band) 


DR = 78 dB 


Switched-buffer OFF 


IMD (-lldBFS) = -62 dBc 


Full receiver (FM band) 


DR = 78 dB 


Switched-buffer ON 


IMD (-lldBFS) = -65 dBc 


Mute-DR 


80dB (FM bandwidth) 
95dB (AM bandwidth) 


Power consumption 


88 mW 
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